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The appli cation i s a o« «- • 
5 S6rial No - 0«/ 6 3 9/255 , filj^\ i0n - in - part ° f -PPlication 

incorporated by reference i„ T ' 1996 ' Which is hereby 

m their entirety. 



« present i^JSSj'^f 11 ™ 

drug discovery. More particular" V to 
a system for preserving the oen Mention relates to 

^od or promising sources L T ° f 0rganisms that are 
^netic materials from one or * randonl y combining 

» generate novel metabolic oa^H SP6CieS ° f to 

screening such genetically 22!' T *" P— reening or 
generation of novel biochL . f ° r the 

of novel classes of compounds *"* ±h * Paction 

metabolic pathways can have ut iliT T* 1 " "^tituted 
20 of the compounds. Uty ln commercial production 

SOIffiSSgop DRUf . 



^e basic chal l^t^^^^ 
25 lead CO »Pound with the desirable « t0 id ^ify a 

lead compound to meet the t ^ t0 °P tim "e 

with further drug development oT" t0 Pr ° Ceed 

discovery involves presentina'n, COnUn ° n approac « to drug 

causing a disease (diseased maCr ° no1 ^^ implicated in 
30 potential drug candidates arlT^ *" bi ° assa ^ i« which 
activity, such molecules cLid ^ 

transcription factors. receptors, enzymes or 

Another approach involves « 
organisms that are represent' • ^ Wh ° le Cells °r 

" ~- such a ge P nt e l a ^ "if -usative agent of 

lln «- include bacteria and tumor cell 

Traditionally, there are two « 
candidates, collections of na tul, " ° f P ° tent i^ drug 

natural products and synthetic 
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chapel,. Identified 

achieved by random °' 0 °»P°>">d. has be e„ 

encompass as broad a ra™. . SUOh coll «<=«ons whioh 
»» recent ^.■^"SV^^"*" 1 U — 

5 "U> further increTs. ' —"""""-a <*emical 

compounds available for . J! the m ~ ber a "° variety of 
•ny synthetic chemical librarvT", < H ° V " V ' r ' the «*— It, in 
Pagination and . UlJ , of ""^ t0 hU » an 

Random screeni n „ V aynthesrs. 

" " '««»*ri.l bacteria "CT — ~ «* 

"suited in the discover eT::'"^''"^ and Plants bas 
•1- 1.91, critical Rev Biotech"!? franco at 

«• "89, i„ . WerobUl "'" 0h " 01 "•»»-««, Goodfell„„ et 

diversity Press, pp. 3 \ «»«—.■. C.„brid ge 
» « = 30 9 - 406; Su „ neM 3 » B^y 1974 , Mv Appi 

Karm. Organisms, o. 0 . Fautin " Radical Importance of 
sciences, pages 15l- 1571 „ ' Cal "°rnia Academy of 
Products are biological [ y tha " ".»oo of t,, e8e n „ tural 

are currently i„ use , s J^*"* " a « la.st loo of these 

" ~ — The success of tbi 0 :' and anti- 

drscovery depends heavilv on t approach of drug 

acreening program. rypi„,T """' y °°'"P°«"<'a enter , 
a=reen compound collections co f tor ~°~"oal companies 
« natural and synthetic " T! 1 " 1 " 9 *— - '"ousands 

" "° Vel to P«viously. diacov °^; U c " a8 - »™, the ratio ot 

in screens for a„ti-" nc e »i<=h 
° f 0,6 »i=robial species » hi c . " 9e " tS ' 

»-M compounds that are a^re. ' ™ ""^cxi, active may 
« due to the difficulties T ""'""arised. partly, this 
" "nding. reproducing ,nd"uool — Magnate y 

aamples. since biological oL ' "° V " " at "« l Product 
underlying moleoular Z^J"'*^ " largely due to 

""logical diversity i„ t h. iS lnsu " lci ant 

random screening, „ hioh ^ZTT" ""'"^ *« 
compounds will be isolated " tne Probability that 

novel 

Novel bioactivitv h= 
— aources. V Tj^jT ~ 
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Enthnobotany and the search fo, 
Chichester. pl78-l 9 6, F Z I drUgS " Wil6y ' 

explored systematically an* ?! sources have been 

For example, it has b ' ' "° r ° U9hly f ° r n ° Vel leads. 
S species have been studil ^ °" ly 5000 P la ^ 

«. ^is is a -nT^^ 1 ^ *~ P ° SSible 
250,000-3,000,000 speci es estimated total of 

(Abelson i 99 o, science ° f ^ in tropics 

estimated millions of sn ^-" Moreover, out of the 

« a small number have been °! marlne nicroo ^anisms, only 
tremendous biodiversi^^ is 
lead compounds. reMlnB ""tapped as sources of 

p^tT^^Lri^nr'r 1 ™' fungi ' inverteb «t es and 

IS products. However, aDa ' , U8e * 33 Sources of natural 

organisms, such as the JL^ Well - st ^ied groups of 

developed for drug s«ee^ ^ bee " 

reproducibility and productLn^ T" 6 "" 1 Production, 
example, the antitumor 2lT\ ** ^ ** iSt ' P ~ 

« bark of mature Pacific yl ^ * const ituent of the 

clinical agent has caused itS SUPPly as * 

ecological system. Tax £ 3b ° Ut to the local 

2048 possible diasterJT ° ntainS 11 chi «l centers with 
s^thesis on . a 0^^."°^ ^ *° ^ - - 

« 1994, Trans Royal Soc Trop Me UnIiKely ^"P-on, 

Marine invertebrates a re a T " ^ ^""J- 
compounds but there exi.* • pronusin 9 s °«ce of novel 

^r conducting drug screens an" *" ^ 

instance, marine inverted*/ la rge-scale resuppiy. Por 

30 and many have seasonal variable * diffiCUlt to "collect, 
content. variability ln natural product 

Marine microorganisms are a Drom ,- • 
compounds but there also e*i J promis ^ source of novel 
technology for conducting drua in the 

35 fermentation with marine JZ M ind «-trial 

-rine -i-rodrg^^'^^'r- 

maintain in culture and m " collect, establish and 

' many have specialized nutrient 
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requirements. A rel 

xe liable sourro 
generally essential for- * unpolluted seavater is 

■* least 99% of ^ " is estimated that 
laboratory media. Furth e ™ SPeCleS d ° " 0t su ^ive on 

5 fermentation egui pinent * ermore ' available oommercial 

co„a itions . cr zzv,: ;:: B ;rr ai for use in 

Furthermore, certain ~ 
when specific o rganis :;7 n :~^- ln °»» 
environment. Pathogens ^"l^T T 
« trigger synthesis of corncob P " ,e " e Session and 

tobacco pl. nt WJcotiaJM s **f*; . P ° r example, the wild 

alkaloids when under attack tlT, ln0r "~" its synthesis o, 
Likewise fungi Cfm * " fro * la ™"e °« -Wuca ss«a. 

» or preventing their accumuia'irTT" 5 ""^"-ticn 
missed by traditional hioh «. metabolites win ^ 

evaluate a fungus toge^'witT'r' S ° ree " S ' " hi< = h "° «* 
example or the intlulnce ot the ^ P1 ° nt * *« «" 

organise is seen with the „„, tUral environment on an 

» aos. of the sodium ch.„„ei P a T n !"* ^ ""^ " ''thai 
oa„ be harvested by ruTbin, ZM'"' 10 "' """=~„. 
glandular back of a field IJt * WO " dart across the 

»e detected in second ^"0""' """^o* 1 " «»« -t 
Csly, 19S5 , Proc . 9 terr.riu»-re,red frogs 

« traditional crug screed' ! 92 = «-«). If only 

Potentially val^T I„ eTs^ ~ 
discovered. SUCh as thes e may never be 

Moreover a 3 ^ _j 
screening r.re ly becomesTdT «— * "ndorn 

30 selectivity. bio.vail.bu t y*or' f"" 

•o^ruate. Typically, a certaL ty ~» ** "e 

is required so that it „„ " Y ° f leM °°""P°»«> 

i»Pfov. its initial activity T " to 
synthesis and development of', ° W ' Ver ' current methods for 
" sources, especially plants „. "•"Pound, from natural 
There are significant obstacles ™ * ^ 

stages of drug development sul . - rlo » 

nt, such as recollection, growth of 
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the drug-producing oroani sm * 

in.proven.ent, „,edia ilH d&r ^ lic ^°-, strain 

^ t ^^^ t ^rrr ioB- marine ' botanicai — 

currently available Ttt\ * Underuse *- ™e 

compounds cannot be applied * Pr ° dUCln * and screening lead 
10 explored sources Unlfv effici *»tly to these under- 
*ungi, these drug-pr: du L e n SOme ^eria and 

-enable to ind Jt^;^'? 1 * ~ e ^ 
Simultaneously, the pre! " tac * TOl <*i«. 

drugs is intensified bv T^J™ flnding novel sources for 

general need for methods or h before, there is a 

and chemical diversit^f th ^ "~ tl0 

compounds for the purpose ITT " Untapped s ^ces of 

purpose of drug discovery. 

20 

is identified <e o „ „ e " e - once » "Olecular target 

r. 9 ui. ting the air.„;, r ne reoeptor invoi « d - 

« end/or .synthesis, therao.!! *" deel 9"sd to identity 

»oiec„ lar level wi j :\ r h T::: 9 c e : 9ents that interaot at * 

Gene expression librarioc 
investigate and produced" t0 
cloning has becoL a con^ \ * "oleomas. Expression 

3« target gene encoding a sin" ! 0 "" 1 "* ° btai " in9 ^ 

Protein. s physicai Properties ^ " ith ° Ut 

Many proteins identif u. 
libraries prepared fromh * SCreening *ene expression 

Potential LLsI ^ t ^ — "« are 

35 1994, Trends Pharmacol Scl Is 4 ' reC6Pt ° rS ^imonsen et al. 
<*rr opin Biotechnol 3.497 ll'S ' Nakayaffla et "92, 

Biotechnol, ? :735-74l and ^ 

' ' and "^al-transducing proteins 
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(Margolis et al., „s 5/ 434 064! 

Natl Acad Soi 84:3365-336 v ^ ^ ^ ' 1987 ' P ">c 

241:825-828; and Lin et al ™ laSakl al " 19 88, Science 
TGF-/3 receptor, f or examples of 2 ' °"" 68:775 ~ 785 ' (*W HI 
5 functional expression clonic, Pr ° teins ide "tified by 
Once a disease t manm,alian «lls. 

or engineered host cerrftW ide " tified ' the P~>tein target 
been used in biolo gical assavsT"" ^ tar * et have 

(Luyten et al. i 993f Trends J" *° SCreen for le ^d compounds 
10 within the scheme or d"! di c 11:247 " 5 ^ - 

expression libraries Zl\ *** USS ° f * e »* 

identification an ^ c ^ ^ "^ed to the 
targets. 0nl y i n thole 1^ P ° tent±al P ">tei» disease 

or small peptide, e " ant^oT ^ ^ ^ ±S 3 P " tei " 
15 been prepared in order'to a ^ eXpressio » libraries 

having the desirable biol JTT^ *"* f ° r molecules 

ciba H P oT ation synp ^SZiT 1 " 1 ** (Huse et al - » 91 ' 

librarrrs'^t^rr^nt 6 ! aPPliC9tions ot gene expression 
20 libraries of microo^^ * discover v- Gene 

Purpose of i.LtTf^na ~ ^ ^ *~ the 

Pa-ays that P^J^^*" ""synthetic 
specialty chemicals. These na^H metabolites and 

Proteins (specif icallv Pathwavs require multiple 

25 complexity than the elLT**™*** ' * ntailin * greater 
-r example, ^Ts^l^T » *"« 

synthases (PKSs) were idenf • ^ 4 

s of bacterial polyketide 

of the organism (MalparJda^ ^ SCree " ing * ene libraries 
Donadio et al. 1991 ***** * m1 ' 1984 ■ N *ture 309:462; 
30 multiple steps atlL^^^"" 7 ** 79 *' PKSS Cat *ly~ 
important class of thera^t 8 ° f P ° lyketidas < an 
structural diversity of « ^" °«'*°»^ and control the 
vector system in Stropton.ya/sTjlT^ Pr ° dUCed ' * bost- 
oirected mutation and exTr- ^eloped that allows 

35 (McDaniel et al 199 3 T ^ PKS * en « 

1-4, science 265 : Jo^T^ ' 1546 ' 155 ° ! "*° * ^ 
bas been used to develop more ef f i^n^ " hOSt - Vector astern 

ore efficient ways of producing 
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Polyketides, and to rations, , 

(Khosla et al, , wo 9 5/ ^ 548) . ^ pol >*«tides 

Another example is « 
indigo, by Mentation °* ^ t ~ til * 

5 containing the genes that ' h ° St ' *"> °P er °ns 

pathway have been genet icanvT ^ ^^V** ""synthetic 
Production of ind J 0 "-ipulated to ilnprove 

al- 1983, science 222 ^ 7 L ^ ** ^ host " ^nsley et 
Bio/Technology li.a"^, ' ***** * ^ 1993 < 
10 heterologous expression of « c °nventional studies of 

involve directed clonim, JIT** 3 -tabolio pathway 

stations, and rearr^LnToT" T^*' d " lTO * 
Proteins known to be invol"L ° ^ that * ncode 

metabolic pathways. nVOlVed in Piously characterized 

disease m^i^T^Tl^'T' ** Underst *nding of 

is an increasing need tot ZnTl ** the ~ 

for rapidly identifying lea " " Strat ^ ies «d methods 

toward cli„ ical testing. COn,pounds *nd channeling them 

20 

The present i* ^^^'^^imsmim 
- -era™^ « *- —very system 

Purpose of drug disoove ry Thl ** 
« captures and preserves In ' a k * ° f ^ inv *ntion 

libraries the g^^^JST" 1 * 1 - "~ SXPreSSi0n 
prospective sources of drug "e 1 ° rganiSms that «e known/or 

constru^^lt't the «- 
30 Varies from^^e ~ ~~ ~ 

^eluding microbes, plants °"? ° f d ° nor danisms 
cannot be recovered i„ substant^^ 18 ' eSPeCiall y th ose that 
cultured in the laboratory T in nature ' or *>• 

»ay be selected on the basis I ^h 0 " 0 "" ° rganisms in ^e pool 
35 properties, or they may be ! kn ° Wn biolo 9ical 

unidentified species of "^"^ ° f k "° Wn a nd/or 

Random fragments of the IIT^ 8 C ° llected f «m nature. 

the genomes of donor organisms, some of 
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which contain enti^ ^ • 

thereof, are cloneTand" 1091 ° C P ° rtions 

According to lTlZT^ ** hOSt 

5 host organism. The natural * ° f f » nc ^™^9 in the 

organise may thus be rJ^^T^^ ° f donor 

The expression of dono genes Tt'h ^ 

and regulatory environment or \ T *^ Slmil * r Physiological 

otherwise silent metabolic oathw^ ter ° l0g ° US host «« unmask 

of the donor .organism may VlTTT' ^ metab ° lic ^ways 

Pathways resident i„ host * metabol ~ 

compounds or compounds not 0rganiSm ^ 9enSrate novel 
organism. nor » a ^y produced by the host 

XS organL"; iTe^es^ 9 ~ -nor 

time, the system can rend S " ^ ^ «- 
easier to detect against an If" 01 ' 0 *"* ™ s 

biochemical/cellulL bal * Characte ^ed 

Essentially tllZJl ^ ° f ^ h ° St 

*o are captured and preser^Tt^" °* ^ 

which can be -replicated 9606 ex P ressi °n libraries 

discovery programs " * different drug 

In another embodiment *-h- • 
construction of combing ch " lnV ° 1VeS ^ 

« libraries in which genet" ! t " ^ ress ^ 
species of donor organism J " t3Nm ° ne ~ -ore 

expressed i„ the noTT^I J"*^ Cl ° ned ' and 

random combinations of '2 ^ libr ~ ies ^-rate 

organisms, which gives rise tl ^ " Ultiple P athw *ys and 
30 discrete gene set! prevW, metabolic Pathways and 

term -discrete ^^7^ in — The 
»ore genes obtained from the T aSSembla * e of two or 

*ore pathway: or oroani s » < Ration of genes from one or 
library. The plu^tTof * expression 
3S functioning i/the host ^"T « •* 

novel, chimeric metabolic P IZZ ThaT "7 t0 f °™ 

of compounds. Thus, the diversity of T ° laSSeS 

iversity of molecular structures 
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available for drug screening < • 

genetic material of the ev* . lnCreased ™i*ing the 
combinatorial chimeric gJ^ZoT^^ *"* ° rganisn > s ±n the 
Xn yet another embodLen """^ 
5 combinatorial expression liberie ™* nti ° n inv °lves biased 
material in the libraries are or! 7 " d ° nor ^ 

contain the entire genome oft^T ^ " 0t 
invention also provides mo J i , * ° rganisins - The 

libraries in which ^ ^T/^ expression 
10 transferred from one or rZl sT" Can be 

least one other species of k T ° f ^ ° rga ^» to at 

While standard methods ; ° r9aniSm - 
libraries can be usah 

or screening gene expression 
to incorporate a reporter r l ^ ^ fUrther »^ifi ed 

» that are expressing the des^T tail ° red ^ 

Products. In a specific " L * ^ M ^^l ic 
engineered to incLe g^^^T ' ^ ^ °™- « 
operatives associated w ath a " ' reP ° rter Pr ° tein 

responds to the desirable l s T^ 0 " 5 " 6 Pr ° mot *r that 
« in the expression library. ^tabolites to be detected 

exposed t" TtZTolZ.lTt^T ' ^ ^st organism may be 
reporter molecule that is ^ * rsor of a 

the reporter molecule by a lT^ direc tly or indirectly to 
« -ught. Activation of JL^T"" Pr ° dUCed in th * 
conversion of a reporter or- °* ^ or 

allows for identification InT^T Pr ° duces a s ^"al that 
clones. °" and eolation of the desirable 

In yet another embodiment of *-h • 
30 organisms in the library JTt -mention, the host 

matrix with a reporter re"Z ******** *» a -mi-solid 
that contains an assay o ^tL'T 0611 ^ 

desirable compound, e . g " * *" 9 target for **• 
oancer cells for antitumor ^ ^-"^tives, or 
3S Processes can be use ° *igh-throughput screening 

fluorescence activated celi ' T^ 161 

cell sorting or magnetic activated 
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cell sorting, to identic 

in a combi„ a t orial gene **" d iS ° late the ^sired organisms 

The positive cL * PreSS1 ° n library. 
Production of novel coml^ ^ *~ <*• 

5 ° f the ^tabolic pathway ^J^ 09 *"* ^chemistry 

compounds may be delineated by cha , Pr ° dUCti ™ of the novel 
material that was introduce, Lto T *** 9enetic 

The present invention iS ° lated Cl ° nes « 

vectors useful for construct^ 0 relateS t0 reco » b i«ant dna 
10 ies, specific a^^^T^^^ 1 expression 

host organisms containing a J !* ^ 6Xpression libraries, 
system, host organism Particular type of reporter 
~ otherwise Jic ^U^. 1 Paction 
host organisms, indicator U ^-tions comprising 

cells and/or a reporter regimen. 



15 

3 ' 1 ' BSEimilQSS 



As used herein *-h J ^^^2^ 
meanings indicated. ' f ° llowi »* terms will have the 

A "combinatorial natur-ai 
*o a library of expression const^c^ 6XPreSSi ° n is 
material derived from a plu ra ^v *** genetiC 

organisms, in which genes preset L tT^ °* 
operably associated with reaul^ gSnetiC materi al are 

expression of the genes inln^ r6gi ° nS ^ drive 
" -^-atorial expression liZllZT,^ ^ The 
are capable of producing function! [ ^ ^hat 

donor organisms. The gene^ ^ products °f the 

organism encodes naturallv ~ material in ea ch of the host 
Portions thereof fro m one of bi ° Chemical Pathways or 

A "combinatorial chW, organisms, 
a library of expression constructsT" M * r ~ loB libr -y» is 
concatenated genetic material T Preparea <*om randomly 
^-ies of donor organisms ^/^cT ^ ~ " "~ 
genetic material are operablv 9-n ~ present in the 

35 regions that drive expres aSS ° Ciated w "h regulatory 

host organism. The host or^n^ ^ ^"^ *" M a PP">priate 
producing functional a* n 9aniSms used are capable of 
.ctxonal gene products of the d«»o>- 

r cne oonor organisms. 
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materia! der^ed tr ™ """""a »«p«.d f r„ g. n . tic 

which has been Preselec'teVrorTs*^ " 
5 pre..lected genetic materi. can f*"' 10 ""Party. The 
combinatorial natural patnC" iTLtSST 

materia! derive fr~ " C ° nStrUCts ""pared *«- 9enetic 
10 and cloned in a shuttle ! °! *"* ° f a ° nor °rg.nieme, 

the donor ^T^TZJ^lT^'" ^ " 
strain o£ donor organism ro at Teast IT" " 
strain of host „„,.,„ . ' one ° thei: "Pscies or 

Prepare ~«>inZlZZ ta lTT ~ °" "* — to 

»s used herein ~! ! """""^ ° r ° hl "* ri <= "»r.ries. 
constructs or host ^i lb " ry " t0 session 

constructs. ""amsms containing the egression 

■^iTS^rtSr P " h "-"< "™ Pathway and 

» "ochemica/reaetL TZTll'"" °* 

such pathways may inclu £ „ \" ° ar " et ' out * an organism. 

or biod.gr.Llv. Path' s o/^:" """^ *" 

or conversion. ' P" 1 *""^ « energy generation 

« produ: t "oTa p :r uie th " is «- — »* - 

- Iht ra tion ."^STS ^ f ~ ~ 

.-JLt^ot'iiTaLT"' inVen "° n - *- f °"°" i "' 
(-Uli-oxar,, „ M ( »icror„i. r 7. ( ^r.T )f , <M ° lar ' ' * 
(»illi»ol.s, ; „mol (micrcol.:, n^w "" 0leS,; "™ 01 

35 (kilograms) ; ~ ror ,„ =1 ' ' ™ o1 (nanomoles) ; kg 

- (nanograms It." »? « <»*cro,ra„s, , ng 

Vd (voices); s * .microliters,; 

as; , and -c (degrees Centigrade) . 
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P.ci tl c S ea„ater ; SSB: m ^£?ZZT>~.'" — 

5 fluorescence-activated cell • 

green fluorescent protean Z k^'k ViCt ° ria 
rp» = Rotations p.'^ "^T^f' »« «™ 

phosphstass; EDTA* Ethv, ' „ intestinal alkaline 

Trie/ 1.5 «, ^^"1 * ln ° tetr " C " iC ™< 10n„ 

" =sr ££rr rr r — 

artificiai c/^o^e; «». W, yea.t 

cau2if iove2 . fl;0saic vi ;J 1 ^ e «* e x-eactio*; CaM V: 

soa± um dodecy2 sulfate ™' EBV - *Ps*e-»-B*rr virus; SDS: 

ace ' CsCl: cesium chloride. 
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4 



MSCRIPTTOt, ^ rTn 

contains vector DBS and T* Th « «*Pressio„ construct 

» genes encoding Metallic 11 -"l— 
regulatory region. Pathway ana natively associated 

construct contains vector dha and f I «*~~ i« 

Pathway^ ss L Cl r ; r "! ry St ~ *~ conoinatori.1 natural 
fro. donor organise ^3 partial T """^ 
restriction enzy„e to generate L " * 
« encoding natural ly-occurrlno L k ^""^ < C > 

thereof. A p Ni vLtoHm J- """""^ ° r P°™<™ 

to generate a vector LIT 3 ""'^ion enay»e 

« vector having cospatible ends (., i. lifted to 
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the fragments of genomoic dna to * m 

(F) . UNA to for ™ expression constructs 

Figures 4A-4C: Assemble * 
depicts an annealed, phosnh ? / 9606 Cassette - Figure 4 A 
S containing a cohesive Si sit! V" **~ 
corresponding to a portion T ' blUnt ^ 

a promoter dimer containing 0 , I **** P±9Ure 48 de P ict * 

* * lac propter. "pTct^ °" — » 

fragments. depicts concatenated promoter 

10 Figures 5A-5P: Clonic 4. , 

steps in preparing pro-oter * . P " U " » 
directionai cloning „ f °^ *«™n.tor ^ents £ or 
SB shows tha steps in Jen. 9en " M1C DNA inse "=- « gU re 

is fragments for ligation I * Pr ° m0ter ,nd *«»*~tor 

shows the steps L p re B a° 9Sn °" iC "* inSe " 6 - « 
cloning, asse^My of genl CD " A ,<>r di "oti„„al 

support. Pignre 5D 8 Zs t\7.T " ' ■ ^ Btt *° h ™»* *° 
inserts for oloning ,„„ t , PS ln P re P a ^ing geno»io 0HA 

- -acnment to soli^™' ^ ^iTll « 

serial ligation and deprotection T Sh ° W the 

concatemer, the ligation o thT ^ CaSS6ttes to *«■ a 
eoli shuttle vector (pDhlet, ^ * " P °^ e/ ^ 

construct from the solid ° f the e *P"^i°n 

« expression construct ^ *"* • ta ^i»tio» of the 

-^^l^;^ — « - Paring 

-p of Streptocos Z c " libraries ' Fi ^e 6A shows a 

unigue BamHX site 'f landed ^3 ZTl^^ °° BtaiM * 
30 multiple cloning site th/ . Promoters in the 

thiostrepton relive* ° rigin ° f and 
(ori,, an m^ZTZ^ * C ° 1El 

— a map of mod^LT^ ~ ^ ^ " 

Edified in the multiple clon^n ! PDbl * t ±S 

3S com origin of replication ^ ' M C ° ntains * 

Copies of autonomous re^:c;tIn" amPiCillin *~ W) ' ^ 
™arKer, and the /8-galactosId! SSqUence < ARS > ■ an ura4 
P galactosxdease gene (LacZ) . A: Aatll- 
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BanHI N: Ndel. p igure 6C 

altered BstXI sequence an/ T ° lig0mer containing an 

excess to ^acI /W I tI cut »L fT WhiCh *" 

Figure 7 shows T ch ^ m ° dified PDblet ' 

5 co n p rising a reporter ^ m ° reSP ° nsive construct p ERD - 20 - GF p 

(GFP) , a che,ore S po; si : e ^:: C ; din9 ^ 

regulator (Xyls, . ™ t6r < Pm > and if associated 

""tri^TUiohTs e a n lTT OPlet C °»' ,risi '" • P^eable 
« gene expression library f",r t<!< \* ° l0 " e fro " " oo„bi„atorial 
• reporter r. g i Mn . *' lndi o"or "U „ hich contains 

a pojrr :oi an :^™ r - — — - 

expression constructs coeprisi """^ t "* ' ° f 

" coir, strain ^ co^ini' -""k ^ '■ 

PHKD-.o-crP, referred 1? » ^ °°" Struot 

cos-id library of »h» ! X "-=PP was infected with a 

with or without LTZT,' TW" ^ 

" hours at 30-c. Tnl su^ectd : o t 9e " eS ***** f ~ 

« sortino. Pierre «. v L1 !" . t "° ° yCles of **«> 
«*ure ,b: octroi ^^JT^"' ""^ 

of »=t^:od°n^dr«e ai :nr of the amin ° 

Predicted partiai acid «- the 

« Plain boxes indicate S LT SegUence d ^ iv ed from cxc-AMMo. 

indicate conserv,":. s!, u T T*' 

sequence horooloqy. 
Figure 3 1 • PCR de+* 4- ■ 

Pools of genome DNA of LrLVb i„ 
stained agarose gel a 
30 marine bacteria genomic DNA „ * mpliC ° nS deriv ^ 
sizes in op. n^tivT^i ^ 

the amplicon and for ribo* T + *' positive controls for 

a^plicons derived fro^T ^ ^ ^ C ° ntai " 

marine bacteria; x " T ! * ^ 311 37 s P ec ies of 

35 bacteria. ' ' P °° ls of gnomic DNA of marine 

Figure 12A-C. pgr detection ! 
in genomic DNA of marine baited CXC-AMN2 0 sequences 

rxne bacteria species. The figures show 
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stained agarose gels cont • 

genomic DNA of individual ai " lng ^ araplicons derived from 
molecular weight Wark ^ «**~ — eria. „ : 

+: positive controls for tho f "* negative control. 

S The lanes contain a.plicons deXTf^"^ rib ° S ° mal ~" 
marine bacteria; species *t , I genome DNA of 

2 and 3 respectively. ' 2 ~ 2 ° a " d #21 " 35 in P°ol 1, 

elements^ the sT^T"' fl9Ur6 Sh ° WS the ^ 

XO P PCos + ura : ttu i tlple o!^^^ ~ sector 
marker (ura4> , cos sitea , 9 (MCS) ' ^ east selection 

o-gin of replication XlZjTT* " ^ ^ ' ™ 
(Neo«), colEl origin of repi icat ' ^ resistance «•»• 

resistance gene (Lp«, a L c ^ (C ° lE1 ° ri) ' a ™P-illin 

« sequence (ARS, . PM,b " aut °nomously replicating 

of thlTr^ 4 ;. ^. fi9Ure Sh ° WS the elements 

multiple cloning TiT ( «c S ° " ~ t ° r PPC ° Sl : 

cos sites for packaging^ pfcT <™«> • 

*° repl ication (SV40 J 9 " ^ (cos,, SV40 origin of 

ori, # ampici^iin resistance » ° ri91n V replication (CoUl 
autonomously ^1^^^ ™~ 

that provides methods and composition^^ " W - 
Preserving the diversity of alZt during and 

gene products; of multinio com Pnse random assortments of 

« allowed to i n +" SP ^ S Which «• in some cases 

CIdCC with each othpr 4.w 
and result in. some cases in , ^ e * press ion host, 

biochemical pathways "nd/ 1 forma tion of novel 

nways and/or the production of novel classes 
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of compounds. Moreover fhoiw 

provide efficient access t rarieS ° f the invention 

molecular diversity Sone ° ° therwise inaccessible sources of 
can be transferred from 1° libraries of the invention 

substance, .aaition « chel * ^ rUCtUral -""""""on « - 
decomposition of ^ .^"^ to the subst.no., or 

10 The novel classes of ™» 

U-ited to ™tabolit.s ' T ~ y in ° 1Ude but are not 
structural component of !„ ^ . ""^"tes. enzynes. or 

«y have one or .ore potenriaTtn 1 "' * °°" POUnd °* inte ~ =t 
including but not llJtea to , ""tapeutic Preppies. 
» antitumor. phar^o^l or T ' a "" Vi " 1 ' 

■» other co^clal ly ! v . lu J'/'7 n0n,0dU1 »ti'>» Properties or 
compound nay serve as 2 . ° h ™ loals »* Pigments, a 

- —Ptor or a ^27^1'" ~ ' 

» .conbinXlrgot ^ ot-b" 0 "' ^ ^ 

combinatory; natural^!!! libr °™" ""compasses 

combinatorial c„Wi c paT" lite «y. 
host organises o^J^.™' 1 -' - -11 as 

constructs. libraries of expression 

A "combinatorial nat-..>^i 
library of expression P * session library is 

-terial aerivS t "n "„ trU ° tS PrePa " d *"» 
in which genes present "I ZJZ' T"'" *°"° r »»""—. 
associated with regulatory rL "'" eriia ° P * rabl >' 

30 the genes in an .p^opri«e h'T session of 

expression library utm'es ho * ° r ' an "'"- The <=°»»inatorial 
ot proaucing functional"!! * or 9" ls »- that are capable 
The genetic material '„ la! Pr ° dUCtS °* """^ 
naturally-occurring bio ! " ""^ "^""^ 
3S fro. one y of « P—ns thereof 

° nStructs Prepared from randomly 
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concatenated oenpf^ 

species of dJL^^V ****** f "» a Polity of 
.enetic material J^^" Whi <* P—t in the 

regions that drives e*»T ass °"ated with regulatory 

S host organism. Th I ~~ * « appropriate 

producing functional o.n 9aniSas Used are capable of 
Upon expression L the 9 h n : st Pr ° dUCtS °* ^ *— 
donor organises) may intera^T 1 *"' 96,16 Pr ° ducts «* the 
biochemical pathways ° n ° VeI chimeric 

*• Generally, the methods of *k , 

P-viding genetic materiel Z frToT" 
organism (s) , manipulating sail ! " d ° nor 

introducing said genetic » ! •* ^ * ateria1 ' a "<* 
zoning or ^^T^^^ « «-* organism via . 
15 d °n°r organism(s) are clon ed V " m ° re genes of the 

organism. such host oroa" e *P"**ed in the host 

material are poo^d to L" " d °~ * e " etic 

the genetic material can b! ! M - lib "ries, 

host organism to another slT"*^** m Species of 

« in which the genetic mate^a I ZX °* ^ ° r * anis »' 

expressed. Depending on the st l * Maintained and 

host organisms and victors Starting ™ ate ^al, choice of 
known in the art can be » reC ° mbinant D ™ techniques 

with the techniques and IrlL , C ° mbinati ° n and any order 
» -.4. 5,5, an, those *" ^ 

The cloned genetic mater'aT 111 "Tf ? * 
random assortment of genes tyP1C3lly «»pr±.e. a 

and controlled by one or ZJZ^ZTT* °* driVe " 
The expression construct Tr vect " ^ulatory regions. 

30 regulatory regions. The a en *** Pr ° Vide so » e of these 

transcribed, translated and ^ d ° n ° r or 9anism(s) are 

Produce functional protexns *" ^ h ~ t *o 

metabolites of interest " ^ 96nerate the 

searches for donor multi-enzyme systems 
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responsible for maki«« 

interest. Genes toac °°*™<«* or providing activities of 
biochemical p,tow, y c "' lnV01 - d ■ Particular 
characterised in , si" ""^"^ end 

5 typical ara ^r :r res,jo " construct ° r Cl °" e - » 

in Figure i. ° h construct is shown 

this ^tziiizt* °i °™ d is 

Oruo development e«~t. " faeilitate downstream 

1. Process development. Z " T" — 

under standard conditio^ f° S " 1Ve «"> "e cultured 

substantial ^JsT.T^TtZuT '"^ °™ 
of the biochemical pathway are .f - The ««~ 

fencing, Butatlo ^ 1 — «*«ly available for 

» screening. The cloned h f T ' ^""^ """^ <* 

amenable to t^,^^ 

overproduction STT^ ~ ~— «~ ~ 

S iient"" h u e „ r rt r :cta b :r:: i ::: r™— « — 

20 discovered more easily b " virt . "* 
reconstitution in to. „ . th<Sir functi °»al 

characteristL of the h°: s t° r9aniSB - ^ «*•— i-1 

deviations as a ree^t of ^ *" "* kn °™< 
■aterial can JLTiZ l 'T '" " 
» aetected by comparing" "I™ hosT ^ 

donor genetic material agains! org.n^m containing 

<=o be produced by the control h T °™*"'" Mt ,0 ~" 

of environsental condi"! Zn ' ^ "* 

desirable activffv "one. Even very low levels of a 

" blochemi^rand ceilulaTrr " h °" «- »°" 

».U characterised .s d ^ 01 "" 1 °' ^ h ° 5t i- 
Ptesent invention pro^ritol ^ ^ SeCti °" 6 ' 
isolating clones tL produce l , dete ° ti '" 
of compounds. de "«>ble activity or class 

>5 In a preferred embodimenh ti 

donor organises, that ct2Ti be applied to 

amounts in nJaL/Tj^SL* *" SUbStantial 

ltUred ln the laboratory. B y cloning 
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genetic material fr 

the organisms. rn^ZllTLZT^ ±Bt0 3 h ° St danism, 
their products tested efff! ^ ^ r «°*»o««. and 
Properties. Thus *T • ffl ««*ly for any desirable 

S is captured and preserveT^ 0 diVerSlty of organisms 
In another embodiment of <-k • 
chimeric pathway gene expression Vr"'""' 9 COmbinat ~ial 
in which the genetic ma^rlaL™ ^ *** Const ™cted 

organisms are randomly oon t ^ ° r multi Ple donor 

xo into the host organTsr.TuTet: hT * 

library may individually contain a ° r9aniSln *" the 

of genes derived from the ^ "nique, random combination 

organisms. Figure , * VariOUS dono ^ Pathways or 
regulatory regions ln of genes and 

XS combinatorial chimeric paXy'T C ° nStrUCt °' 9 

the most part, SU ch combi^on f e * PreSSi ° n lib ~~y. For 
not occur in nature. Up n 1 ^ library do 

products of the various dl"' *** functi °«al gene 

with each other in individual host^ T inte " ct 
20 combinations of biochemi^ danisms to generate 

chimeric metabolic |£f ^ ~» " *» —1 

compounds. Collective^ I Paction of novel 

organisms in the l ibra J ' J** * eneti * resources of the donor 

chemical compounds that may not "eT^ ^ " diVersit ^ <* 
25 organisms. * * be found individual donor 

In another aspect of 
organisms may be selected on .k"^^ 0 "' ^ SP * CieS of ^ 
characteristics, or abiliJ \ ° f their biol °9ioal 

uncharacteri.ed biochemi-f ° Ut desira *>le hut 

30 to the host organism " £££ ™ Elementary 

include, but are not limited to t h -Y 
certain nutrients, to survive un^ *° UtiliZe 

Privatize a chemical struck C ° nditions ' to 

down or catalyze formation of certalT """^ ** ***** 
35 linkages, when genes of the don ° f ChGnic ^ 

the host organism, the donl or ^"ism are expressed in 

substitute the functus of ho'T ^ ™«<" 

ns of host gene products that 
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ZT^ltlZTs "'XT PaUWayS < *™ «»* novel 
produced by hybrid aCtiVitieS founds may be 

components ^Tta-et I -~ and host-derived 

S gene products ly ^ by — 

Alternatively tL ! ^ h ° St or * ani *»- 

by products of h^ter^:" 6 ^ 0110 be ™* d 

have been genet i^Uy 2 *** endo *«»ous or 

Prior to or c^nteloLn ^ ***** h ° St °^ ani - 

XO expression 1^™^° ° B ~ tl " « **• g ene 
embodies constructing and' Pr6Sent inVenti *» 

libraries, wherein DN A TrZlT^ T e ^ essi °" 
of donor organisms are ci^ 17"°° * metab ° liC 
organisms cortainina T ! coexpressed in host 

« m ano r 9 ^ metabol i<= Pathway. 

organ^m^vrre^; 1 ^ "» ^ 

e«lux systems which sec'e^s °* ^ 

the culture medium, thul rel C ° m * ounds <* Merest into 

compounds to th* h T red ^mg the toxicity of the 

ao neuLl" sL mayt^eTd 8 :- ^ 
organisms, whereby metabolit * ° f th * *»t 

host organism may ^ ~ f ^ d by 

of the metabolites " eTO ' t * Md ' thus facilitating recovery 

« gene l^ltllT ^ZlrZl ^-^orial 
invention further provided LthL T^' Pr ~ Bt 
organisms in the iLrlry l h T det -ting those host 

compound of interest L P ° SSeSS aCtiVity or 

the host organism conta ns^^r 0 :^ " ^ inVenti °"' 

30 respond to the presence of an T ^ W±11 

Presence of J de bl -Produced change, such as the 

the de novo synthesis of " by bating 

embodiment the host \' VOKtn ^cul,. In anotner 

-porter molll or ^ ^ ~~ ° f * 

3S converted to the ; e r e tt 9 " 1 ^ * 

compound or activity tL PreSe " Ce ° f ^ desi "ble 

clone generates TsL T m ° leCUle in the * ositi - 

erates a sxgnal which allows detection of the 
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positive clone in the 

isolation from the oth^^ library ' as as its 

t» »- r non -Productive clones 

in many respects th ne8, 

significant convenience and dlSC ° Very S * ste » Provides 
5 steps of drug develop Vantage to the various 

libraries of the "Cent ^ *° CliniCal tr±alS - T »* 
established .ulti-Ce" fo^t 3 " *h. 
high-throughput JTST^ 
invention are oroani^ organisms of the 

10 manipulation and/o'Z'T^ '™ tl ° 

invention ta,es advan" ^ >™ 

organisms or production L I ^ SUCh host 

ot their Moi: g i c rrXe;tts are d Well " CharaCteriZed *» 

By cloning genetic materials L maint enance requirements. 

" ^ expression systems" t'he^n^T" 181 " *" ^ 

culturing conditions for tLZ '** dlfficul t 

Thus, the biological active ^ n °! h ° rganiSm is " d -ed. 
toxic properties of anv ml Paleokinetic and 

system of the invention IT^ 0 ^ discovered i" the 
2 0 effi ciently . " " ay be Studied *"d optimized more 

The novel metabolic pathway generated i„ 
clone can be delineated h« «. generated xn a positive 
biology. The lead co ^Z^"^ tec «nigues in molecular 
clone of the.drug- P r:: u P c T g ho^t by -during a 

» empirically determined culture T""" ~ 
quantities of the lead co S Condlti °ns, so that sufficient 
analysis and --JSLr^^^*- ^ 
manufacturing protocols such already high purity 

(GMP) established for SM " ^ ^'"turing Practice 

30 organisms, unlike conventicle^ indUStrial *ost 

Product sources, less Iffo^ ^ ° f Scree «^ natural 

screening and production I V* reqUired to ada Pt the 

requirements of JT^J^IT' *° 

tlal dru 9-producing organism. 

3S Moreover, once a postiive r»i 

screening assay, the sequences in identified i" a 

metabolic pathway or portions th enC ° ding the 

portxons thereof can be isolated and 
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used as hybridization probes D1 . h 

combinatorial gene exJL ge " e lib «ries or 

organise, or related or^nT" Ubraries of the donor 
Probes to isolate _ SCr * ened 

with such 

5 ° ther 9en es in the same op^ 5 ^ thC natUral Pathaway or 
expression consturct. JlT ^natorial gene 

reintroduced into and seQ ^nces may also be 

»aking gene ex pression °** P J SSSed in * host organism for 
and expression can be repeated^' /** ° f scree ^S 

*• genetic and chemical £££ ^ in ™ the 

expression libraries. combinatorial gene 

The present invention „i 
combinatorial gene J^Z^IJT^ 

methods of the invention fZ made -cording to the 

XS particular set of donoT 9en6tiC Materi *ls of a 

organisms or cell types TnTsT W ±mm ' ** ^ 

need to be individually iLLTiJ^t^ — 
Preparation of the combinatory^ Li ^"^^ to enable 
The invention further Zj, J e *P"ssion i ibraries . 
20 -»bi„atorial gene expression * rChiVal * ro ° bili ^ble 
material of the donor organism in Which the genetic 

species of hos t organism to alth tranSfe " ed **» one 

organism. These libraries ar SPeCi6S ^ Strain of *>* 

genetic material of the don partl oularly useful when 
» diff icult to 6 organs is unique , rare or 

expression of the donor Z n Z " d6Sirable to obtain 
secies or strains of Lst ora mteria:i *" ^ 
vector contains the appr^r 1 "* • ^ ^ Cl ° ning 
selection mechanism ^ ZLT ° rigin(S) ° f "plication and 
30 genetic material in the l"b' WXW(m > ° f transfer, the 
stably maintained J l^Z IT f! « b * 

of host organism. The JIn S !! ^ SP6CieS ° r drains 

example, isolation of thl ^ effect ed by, for 

introducing the constructs InT" 10 " COnStrUct s and 
35 such as but not limited ^T" ° r9aniSin ' by ^ 

transfection, infection or elect " 0rination ' 

or electroporation. Alternatively 
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the transfer can be effect 

appropriate host organisms. * baCterial Ration between 

Any combinatorial 
inve„ tion may be ampii.L l****** 1 ™ llb «^ the 
5 A»pl ification refe ^ ^ led ' "PliMfd, and stored 

containing dcnor dna so llTl^l T ^ or * an ^ 

organise are produced. RepJca, P Cl0 " eS ° f the h ° St 
growing of individual clones i ^ P±Cki ^ and 

gene expression library of * library * A combinatorial 

10 retrieved by any techniques tLZTtT 

appropriate for the hoJ the art that is 

the invention a^an Tf'T ThUS ' the Varies of 

preserving the genetic r^ ° f Capturin 9 »nd 

"•ay be accessed repeated^ 0 "^ ° f d ° n ° r ° rganisms < which 

Peatedly ln a drug discovery program 



15 

5.1 



5.1.1. 



DONOR QPGANT-fiMg 



Any organism can be a d^T"^^ 
Preparing a combinatorial c,!" *° r *** PUrpose of 

invention. The donor ^J^LT"**"*™ libra ^ of the 
or public laboratory culture fr0m privat e 

the American Type Culture ZlLllZTZ* f^*' SUCh 
25 Mycological Institute or ^ tl0n ' the ^ternational 

cultivable ox- uncultivable enVironme "tal samples either 

Plants. The donor organist bact eria, fungi and 

manipulated or genet^ T ^ tranS * enic ' 9-etically 
useful in generating rn^/l 1 :::^ ^ *een 

The donor organism Tm" 9 
current state-of-the-art li k * livable with 

genetic material used to ^ i!* 1 " 1 techni * ues the 
obtained directly f r l * P " the lib "ries can be 
Minority ( , , %) ^ the HZHl^l T*' *** ' 

cultured in the laboratory T Ca " be 

oratory, the major advantage of the 
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present invention is that th« „ 

be cultivable to be u t il? k organism does not have to 

APP1 Env Micro, 56 :782- 787 7 ^ ^ 

The invention is not limited to the use at 
5 microorganisms as donors Pi a „«- 

of compounds, some with draltl *" en ° rraous 

and microorganisms, c « " ° « ^ animals 

Science 2 47: 513) . SoiBe ^ Zs ' ^^^^ "50, 
wounding or elicitors derived Lr^r"* 8 indUC±ble * 

10 pathogens (Cramer et al Z R t Z ^ WallS ° f plant 

al. 1*85, science S^o! 4:285 " 289 ' — - 

Natl. Ac ad. sci. USA 85 6738 ! 6 4 7 3 .r" 6 '/ 1 : 1988 ' ^ 
compounds, l ike taxoi } * Blolo * ic ally-active 

examples o f S^IvT* T 601 "' "* « 
» undergoing clinical ^ 

and antimalarial agent! T "^-ely as anti-tumor 
Potential medicinal^rtieT ^h*' 6SPeCially ^ 

organisms (^-illipson, L 94 deSirabl * 

Suppl 1:S17 . £; chadwi ; k e 94 ' T "" S SoC Tr °P Med Hyg, 88 

20 the search for new druos" 1, ' *" " Ethno °otany and 

Symp 185) . 9 ' Wll6y ' C »iohester, ciba Foundation 

Another source of nai-nv a i 
useful antimicrobial or J" " 

invertebrate, and v. ~ «* Parties are 

« as chemical defenses T' ° f om " ,aMs serve 
Peters. ^Z^ZV^T^ P ^" S ™ 

— es a for, of coL"^*' * *° P " y » 

-.a sci 92:75 . 82) . In :::: nr 1 :,: 9 ::' t Nati 

metabolites are thought to h. ! secondary 
« that may he eymbiotio CFaul^r "tT ? 

"alien,, i 2J:301 - 307; It al \ 995 : °""" tt " Chi " i0a 

Symbiosis in Marine Natural o ., ' l " overvl ew of 

honor of Professor Cnio^LT *» 
canary , i as pt _ er „ - ^-J-ity, 

other^nw^attrrr" ^ - 

e.*. certain plants. s „ ch rcy";" 0 " °* int «~". 

as cycas, can produce an ecdysone- 
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mimic which disrupts ^ 

such organisms M fu^ i^e "* ""^ ±nSe ° tS - 
they exist as c * " Sa ° e ecological niche where 

5 genetic materials delved C 'L * d ™* a ">™ <=° — 
chemically „ lth others " ZZ"™ 1 *"' 

molecular tLt l:;" 1 ":.^ 0 ' 65 Pr ° dU ~ 

» octalactin A VhicHs a p tl J ^ntT "^"^ # * 
molecular structure no t potentl » 1 anti-cancer agent with a 

hacteri. ^TolZ e "^'.T * 
»3>; ana salinamides ^ ^ ^ 

116 = 757-758) which have potent „«, . ', " Che * S °° 
15 certain compound, derived " tcry properties, 

bromine fro. ...water ""Tr fc -™««»l— contain 

active hecause cf the chela TTZtZ^TT ■ 
halogen, e.g., „» rinone (PatM ™ « " ,LT ^^"^ 
Lett 33:7663-76««i , I'atnirana et al. 1992, Tetrahedron 

,0 mevalonic acid oiosyntLT 01 " — 

antihiotic activity r^nst ^ " hi ° h S<Sle0ti « 

» vast diversity of 9 "° P °" I " V ' '"".tie. There is 

hahitats. eroi ^a ™op S LT S ""^ *" ' « 

salinities, t«peraturL T re ' 10 " S ' 
» these W^TZ IZZZ ":**- U " iqUe « 

biochemistry or these ^ dlSti " Ct •« 

drug leads, a., "; * M " ay P "™« — « 

-Marin. Microorganisms L„ ? *" 

Marine Biotechol. Vol r pf! bl °'° 9lc » 1 resource", A dv in 

»• Environmental samp ;s",y 1 " SS : ""^ 

man-mads environments a„d Z ! °* "^ ri ' 1 ° r 

profcaryotic end eufcaryotTc organ! " " 

which may be unidentified sIsolT' a " d VlrU 
collected or 'collected fro» ' "* """o- 1 * 

35 stressed, for e*Zl. ^ologically 
' 1 el ">mple, near an industrial effl,..nt „ •, 
freshwater or seawater ni*.... serial effluent. Soil, 

cr thermal wonts IT Jit ' """^ """"^ h0t s *> ri "*° 

vents, and manna or estuarine sediments may be 
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Z^J^J^^^ - - collected 

Samples may be collect fro"^ S ° UrCeS - 
temperate and other reaLs T* ' SUbtr ° plCa1 ' 
5 thermophilic hal!nH ^ ° r 9 anisn > s »*V be 

.nethan^ie.' hal ° PhlllC < -^philic, barophilic, or 

-e po^LL 1 :; itizzi:: use r anisms that are — 

»icroor g anisms^oun d " "he ^opi:" 33 ^ ^ 

10 such habitats are beino d«* , InS ° far as 

• ^ 9 destr °yed/ species are beino lost 

that might yield useful medicines. 

Organisms with potential • , 

algae, lichens, fungi plants , Pr ° PertleS ' 

f±l fch. «*. • following characteristics: 

fiii> th» " lole ' lt 'or further experiments; and/or 

hiocheeicaUy character"^ J^T" t « 
footprinting of a cultivabl^ ™ <i " t:i,lcatl °" « ""«ic 
group o, species f™ .r • " " 

3S perform L^'He " .""T^ """^ ta 

neeae of the a™a 1 <=°»>»«><i<=y <* the eaapie ana the 
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acids F OT - J P ° extracti °" of their nucleic 

acxds. For preparing cDNAs , specific growth conditions or 
the presence of certain chemicals in the culture may be 
VTT t0 ° r enhanCe thS ^anscription of gene 

o" n u i:: s enc s c : in d g the activities ° f *» 

organxsms standard growth conditions may be used to culture 
the organisms if only genomic DNA is required. 

Sxnce it is unlikely that all donor organisms in an 
XO environmental sample may be propagated at the same rate if 
at all under laboratory conditions, some of the donor 
organxsrns may overgrow and lead to the loss or dilution of 
slow-growxno organisms. Thus , it roay be preferable ^ 
Prepare nucleic acids directly from donor organisms in an 
13 envxronmental sample without prior culturing in the 

laboratory. This may be especially useful when attempting to 
access the secondary metabolites of invertebrates such as 
»arxne sponges, where the metabolites are often believed to 
20 mxcT" 6 " ^ ^ aSS ° Ciated ^otic and uncultivable 

I d M&th0dS ^ high «™^y nucleic acids 

from donor organisms in environmental samples are provided 
below in Sections 5.1.2. ^ ° 

inclur'K : r9aniSIDS C ° ntem P lat «« * the invention may 
25 Z ^ limited t0 VirUSeS '' ^cteria; 

funl 6 Dl r / Ukary0teS ' SUCh 95 ySaStS and Pecans; algae; 

T'' tUniCateS '' ^yo-ans; worms; echinoderms; 
xnsects; mollusks; fishes; amphibians; reptiles; birds • and 

IHabies — °* — are listed 



30 



35 
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Table I: 

""if°M? X 5pi a Si^ * on °r organisms (Berdy 

1989, in "Microbial pJoducJf: m 9 "i°" 6; Goodfellow et al. 
University Press 343 - * W A PP roac ^s», Cambridge 



10 



15 



Bacteria 

Actinomycetales 



Eubacteriales 



Pseudomonadal < 



Fungi 



Mycopl asma tales 
Myxobacterial es 



20 



25 



Myxothal 1 o phytes 
Phycomycetes 
Ascomycetes 
Basi di omyce t es 
Fungi Imperfecti 



Yeasts 



Streptomyces, Micromonospora, 

Norcadia, Actinomadura, 

Actinoplanes , 

Streptosporangium, 

Microbispora, Kltasatosporla 

Azobacterium, Rhlzobium, 

Achromobacterium, 
Enterobacterium, Brucella 
Micrococcus, Lactobacillus, 
Bacillus, Clostridium, 
Brevibacteri urn 
Pseudomonas, Aerobacter, 
Vibrio, Halobacterium 
Mycoplasma 

Cytophaga, Myxococcus 

Physarum, Full go 
Mucor, Phytophtora, Rhizopus 
Aspergillus, Penicillium 
Coprinus, Phanerochaete 
Acremonium (Cephalosporium) , 
Troche-derma, Helminthosporium, 
Fusarium, Alternaria, 
Myrothecium 
S a cch aromyces 



30 



35 
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Higher forms of 



Table ii: 

exemplary donor organisms 




10 



Lichens 
Higher Plants 



15 



20 



Protozoa 

Dinoflagellates 

Insects 
Bryozoans 



Molluscs 
Sponges 



25 Corals 



Worms 



30 



35 



Annelida 

Spinunculida 
Tunicates 

Fish 



c Property ft 

SGSLSSKS' (kainio acid - 

Usnea fasclata (vulpinicaeid 
antimicrobial; usnil^ctS, ' 
antitumor) ' 

Dit h i¥ r V! thU , S (Vinca alkaloids) , 

ff»Ptotteca (Camptothecin) , 
Artemisia (artemisinin) Cole»^ 

channel agonist) 

Ptychodiscus br&vis 
(brevitoxin, cardiovascular) 
Dolomedes ("fishing spider" 

bIan m hi^f PlI ?^ a C~lS« 
bean beetle alkaloids) 

Conus toxins 

"f?f °? io "a . P rol *fera ( ectyonin 

i D ~ arab ino furanosides) 
?f eu *>fc«r©grania species 
(Pseudoteracins, anti- 

inflammatory) 

LumbriconGreis heteropa 
(nereistoxin, insecticidal) 
Boneliia viridis (bonellin 
neuroactive) Wlln » 

Inifitf^" 10 s ° li<3un (didemnin, 
anti-tumor and anti-viral) 
Ecteinascidia turblnata 

in^ ln f scidins ' anti-tumor) 
Eptatretus stoutii (eptatretin 

(proteinaceous toxins, reduce 
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Reptiles 
Birds 



Mammals 



0 
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Dendrobatid frogs 
(batrachotoxins, puroiliotoxins, 
histraonicotoxins, and other 
polyamxnes) 
Snake venom toxins 
histrionicotoxins, modified 
carotenoids, retinoids and 
steroids (Goodwin 1984 
Biochemistry of the 

2£°I£?? ld 5" Vo1 ' 11 ' ^apinan 
168) Y ° rk ' pp ' "°" 

Orlnthorhynohus anatinus (duck- 
billed platypus venom) , 

anS^i 6 * ? anten «>i<Js, retinoids 
and steroids (Goodwin 1984 
supra, pp. 173-185; Devlin'l982 
in "Textbook of Biochemistry" 
Wiley, New York, p. 750) ' 



WO 98/17811 

PCT/US97/19958 

varierrol^^T^ 771 ^ f »» *— orgTnlsL by a 

nucleic acids that £ " !" P ? I * Mt to <**•*" nigh quality 

ana partial denaturation ! ' Sl "* le str "nd.d gaps, 

(.spaciaiiy tor ,:„:» c w r:i:„ r r n °f hi9h ° oisouiar 

gana expression libraries that * ' t0 
„ the genetic information !, a "Putative of 

guality nucleic acL th a' ""^ — - P«p,re high 
gentle, rapid ana coaolet T *"* inTOnti °n Provide 

saaple. and rapid Z , Y5iS ^ m the 

other degraded "ot r '-activation of nucleases and 

1S extraction M y oe carried T" "* 0r9 "" iS, " S - 

nucleic acids in the s," " t0 the 

- pe„omed in the ia^ory ""^ iS ° la " OT ~ 

Any nucleic acid i so i a f,'_ 
breakage of the do „o ~« efficient 

20 techniques may be used TnTlZ' °* Standard 

nitrogen, griming ^he ! res 9 *» liqUid 

disruptive agents! as /"TV' 91955 " 
enzymatic digestion. * * mechanica * shearing or 

For mixed materials such as 
2S contain high amounts of touah \' ^ 

chitin (as in *ll~JL?^^\™ CellUl °~ « 
freeze-drying may be employed * ' inStance > ' 

thus making them'more ZlZll JT" T 
lyophilized materials p re ^ ^ " *"* 
30 molecular weight materials ZlT «m»*io as well as high 
Periods (Gumey 1984 i " M JhT " nUCleiC aCidS > for ^ 

*. p3 5 - 42 , Joh n m. :;i k : r M :; h ; ds r r ecuiar bioi °-' 

in liguid nitrogen. sampled tat a^ H ^ ^ fr ° ZSn 
can be frozen in fl - BO „ 6 loose ' such as soil, 

,s can be carried out o„ "rosea™" 

period of 24 -7 S hours 7r« a VaCU "" £ ° r a 

desiccated under vacuum ll ToZ^lTT^ ^ "° 

/o c. Additional steps may be 



WO 98/17811 

PCT/US97/19958 

required for preparation of en vironmenta 1 samples, such as 
concentration of microbial populations (Jacobson et al. 198 2 , 
Appl Env Microbiol, 58 : 24 58- 2462 ; 2hou et al. 1996, Appl Env ' 
Microbiol, 6 2 :316-3 22 - c^„>-,^ -, -, ^ , 
m . . . JZ2 ' Somervilie et al. 1989, Appl Env 

5 Microbiol, 55:548-554). W 

One principal method of the present invention, though 
certainly not the only one to be used, is modified from 
Chirgwin et al. (i 979 , Biochem 24:5294)# Sadler 
Curr Genet, 21,40.-416) and Poster (1991, Ph.D. thesis, 

10 University of California, Santa Barbara, . The method uses 
the strong chaotropic agent, guanidinium isothiocyanate, with 
2-mercaptoethanol to denature proteins and inactivate 
nucleases, followed by purification of the nucleic acid 

1B ma !ri al ^ CSSiUm Chl ° rlde 9r.di.nt centrifugation. The 

b^°n M r OV i ded hSrein di " erS ^ <*^i»'- method in that 
both DNA and RNA are extracted. Also included in the method 
of the invention is a high speed centrifugation step, and the 
optional addition of bisbenzimide dye. Depending on the 
donor organism used, additional steps may include, but are 
20 not limited to, treatment with hexadecylpyridinium chloride 
or cetyltrimethyl ammonium bromide (CTAB) to selectively 
remove polysaccharides, treatment with polyvinylpyrrolidone 
for removal of phenolics, and cellulose chromatography for 

» 17*7*1 °\ St : rCh ° ther -rbohydrates (Murray , Thompson, 

25 1980, Nuc Acid Res 8: 43 2 l- 2 5) . 

RNA isolated from donor organisms can be converted into 
complementary DNA (cDNA) using reverse transcriptase. 

Damaged nucleic acid may be difficult to clone resulting 
in loss of donor organism DNA and low numbers of clones in a 
library. The problem can be worsened if the host organism is 
permxssive for recombination and lacks effective endogenous 
DNA repair mechanisms. The present invention also provides 
that damaged DNA can be repaired in vitro prior to cloning, 
using enzymatic reactions commonly employed during second 
35 strand synthesis of complementary DNA (Sambrook et al. 1989 
in "Molecular Cloning" 2nd Edition, section 8). For example, 
DNA gaps and nicks may be repaired by the Klenow fragment of 
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reactions are well *„„„„ to aose M1M ^ ^ ^ 

DNA e«"cteaT in9 ' """""o^ 1 expression library fron 
5 av*il^r^! environeental sasples, the guanbity of 

selection of \" . """^ ™ *" ' ~"'"«««»» i" the 

"tract"! I Str " te9y - " is low after 

I«o a h , . nati ° n (<10 ° «» ' "» D1 " "e ligated 

into a high-efficiency cloning system e.g., Supercos as 
described in Section 5 .i. 3 . The m^/^ cl Z'Jlr. 
1. amplify ana are released fro. the vector by restriction 
en*yne digestion. Due to the nature „ f mm Zon* mt lTm* 
-.pies, which nay contain both prOcaryotic and eu^aryotic 
donor organises, it nay be desirable to use multiple host 

» h . P "Variable, or if the DNA has been amplified 

the DNA nay be ligated to each of a panel of vectors 

p'refe^r IZ -~ «* expression host cells. 

or more expression hosts. 



20 

5.1.3 



ffOST ORGANISMS Attn gECTQgS 
The tern, "host organism" as used herein broadly 
encompasses unicellular organise, SUC h as bacteria, and 
multicellular organise, such as plants and animals Any 
« cell type may be used, including those that have been 

lystZV" Wtr ° ° r genetica11 ^ engineered. A „y host-vector 
systems Knownin the art may be used in the present 
invention. The use of shuttle vectors that can be replicated 

30 aT m ! lntained in «»» °ne host organism is r6PllCated 

3 0 advantageous. 

private"! ° rganiSmS ^ h ° St CSllS be ° btained *~ 

as the M T ^ deP ° SitS ' PUbliC CUltUre -lections such 
as the American Type Culture coii on f 4<( 

suppliers q ueh k * Collection, or from commercial 

35 modi^T: organisms or cells may be further 

35 modified by techniques known in the art for specific uses. 

host TaZ t0 " he inVenti ° n < * - Preferable that the 

host organism or host cell has been used for expression of 
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host or9ani L y ^ cr^ o :„ y - r /or 

strain improvement T^ZZZtZTT 1 ^ 1 9e " etiC 

». =ona lt T :„ °: :™ s r r y be r turad — — — 

and media composition T lnCUba "° n "»«• °P«o.l density. 

phy.io 1 o 91 c.: ! :^ i i ™":r»e iB9 to the m — - 

conditions for ..in. session host. However, 

di«ere„T f r:: SHT" « * -Y - 

is library. Modified llZrl lZ ™ °* 

used to emulate s». T oonditions and media may also be 

of t h . donir o;,.:r:„s i: a r f ana 

interesting met.bol^ L fa " Ut "* Production of 
interesting compounds may be provTo^nT" 1 P ™°" °' 
». media to facilitate modlricaJZ or nUt ""°"« 
technics *„own in th. art Lv H PreCurs °"- ^ 

optimal conditions. * " PPlied *° ""•»»"■» the 

The host organism should Dref.r»hi„ j «• . 
abilities to underoo ho m , P" f orably be deficient in the 

« foreign DNA The bit ^ reCO " bi "« i <»> »"° to restrict 

codo„ 9 U8 age si„ r a r:: : h r a 9 r:rth sho r d prafe - My have * 

eu*aryotic donor organisms .re 7 " 

the host organism has th" abUity to " Pref " able 

messenger SNA properly . ! " d ° n ° r 

30 D,— ^ Periy, e.g., Bplloe out intr 

^lamentous ascomycetes. soch as »eurospora crass, and 
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those derived from Ki J£T ^ CSlls su <* as 

Preferred mammalian nos+ ™ 

are Preferred. 

those derived froin humans lnclude but not i iraited to 

* chi„ eee halter ovary ( cL ^ SUCh - 

etc (see Kriegler M. in ^„e ' NIH/3T3 ' C ° S ' 293 ' ™°, 
Laboratory Manual", New y Q ^ T ****** Session: A 

A *ost organic Z, ?^?"* & ^ 1990 > • 
processes the expressed a~n Wh±Ch aodifi ^ and 

Processing (e.g., cieavage) of p't ^ " ' and 
important for the function of \* Pr ° dUcts «Y be 

pathway. Different host Tells h Pr ° tein " 9 bioch «»-al 
specific mechanisms for the „ ° haracteri «tic and 

" ffl ° di ^-tion of Proteins. Cop^aT lati ° nal * r ° C ^ and 
^sterns can be chosen to ens!rl f h ° ellS llnSS or *** 

Processing of the foreign protein n ° dif Nation and 

*u*aryotic host cells JLh T e * PrM " d ' T ° this end, 
*°r proper and accurate proof * ^ CSllUlar machinery 

20 ^-ylation, and p^ZryllX" 1 ^ 

Preferred if the donor ° f **• ^ne product m ay be 

For example, it haH ^ m eukar y°tic 

share much of the same cor!" **** euka ^°tic fungi 

-change is possible ^^^" Ul ~ "ology. and that gene 
" Speci - (Gurr et al. 198 " ^ ° f the common fungal 

Microbes, Kinghorn ed. ' 3 St ™cture i„ EuJ caryotic 

Manipulations i n PllBgl Bennet & ^sure 1992, Gene 

example of a eu*aryot lc hi" ^ W) ' A deferred 

^ osacc , arefflyces pombe F r r r s 9 t ani t S ; 13 yeast, 
30 S. po^e ls h±gh the molecular biology of 

Purification processes anl ! * n,aj ° r CUltUre and 

Performed. second Tt IT are routinely 

-Itured, stored, and ^ZlltT^' ^ ~ ^ b * 

Third, and of particular i!^ " lab ° rato ^ netting. 

35 mixed eukaryotic DNAs It * ^ use in expressing 

expressing genes of other spe I^! f PrOPerl * s P^cing and 

mammals, studies of the ** °* fUngi ' P la "ts, and 

the spl 1Cing and processi 



WO 98/17811 

PCT/US97/19958 

heteronuclear rna (Rna which contains introns) have indicated 
that S. pambe shares with other fungi and higher metazoans a 
remarkable similarity of pattern and structure of small 
nuclear RNA (snRNA) components needed for splicing. Finally, 
5 many non-s. pombe promoters, some of which derive from 

mammalian and plant viruses, are capable of driving moderate 
to high levels of gene expression (Forsburg, 1993, Nuc Acids 
Res, 21:2955) This feature can allow the shuttling of a 
fungal DNA/cDNA library to mammalian cell expression hosts 
10 such as NIH3T3 (fibroblasts), on-7 (neuronal), or other cell 
types. 

A cloning vector or expression vector may be used to 
introduce donor dna into a host organism for expression. An 
n^ reSSi ° n C ° nStrUCt is an egression vector containing donor 
15 DNA sequences operably associated with one or more regulatory 
regions. The regulatory regions may be supplied by the donor 
DNA or the vector. a variety of vectors may be used which 
include, but are not limited to, plasmids; cosmids; 
Phagemids; artificial chromosomes, such as yeast artificial 
20 chromosomes (YACs) , and bacterial artificial chromosomes 

(BACs, Shizuya et al. 1992, Pro Natl Acad Sci 89: 8794-8797) 
or modified viruses, but the vector must be compatible with 
the host organism. Non-limiting examples of useful vectors 
are Xgtll, p WE15 , superCosl (Stratagene) , pDblet (Brun et al. 
25 1995, Gene, 164:173-177), pBluescript (Stratagene), CDM8 , 
pJB8 , pYAC3, pYAC4 (see Appendix 5 of Current Protocols in 
Molecular Biology, 1988 , M . Ausubel et ^ Greene publish 
Assoc. ft Wiley Interscience, which is incorporated herein by 
reference). An exemplary cosmid vector, pPCos+ura, for 
30 cloning and expression in Schizosaacharomyces pombe is 

provided in Figure 13, and is deposited on October 24, 1996 
at the Agricultural Research Service Culture Collection 
(NRRL) at Agricultural Research Center, U.S. Department of 
Agriculture, 1815 North University Street, Peoria, Illinois 
35 61604, United States, and is given accession number B-21637N. 
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When the regulatory regions and transcription factors of 
the host and donor organisms are compatible, donor 
transcriptional regions will be able to bind host factors 
such as rna polymerase, to effect transcription in the host 
5 organism. If the donor and host organisms are not 
compatible, regulatory regions compatible to the host 
organism may be attached to the donor DNA fragment in order 
to ensure expression of the cloned genes. 

In cases where the entire operon, including its own 
10 translation initiation codon, ribosome binding regions, and 
adjacent sequences, is inserted into the appropriate cloning 
or expression vector, no additional control signals may be 
needed. However, in cases where only a portion of the coding 
sequence of a gene is inserted, exogenous control signals, 
15 including the translation initiation codon (frequently ATG) 
and adjacent sequences, must be provided. These exogenous 
regulatory regions and initiation codons can be of a variety 
of origins, both natural and synthetic. Both constitutive 
and inducible regulatory regions may be used for expression 
20 of the donor DNA. it is desirable to use inducible promoters 
when the products of the expression library may be toxic. The 
efficiency of the expression may be enhanced by the inclusion 
of appropriate transcription enhancer elements, (see Bittner 
et al. 1987, Methods in Enzymol. 153:516-544). 
25 "Operably-associated" refers to an association in which 

the regulatory regions and the DNA sequence to be expressed 
are joined and positioned in such a way as to permit 
transcription, and ultimately, translation. The precise 
nature of the regulatory regions needed for gene expression 
30 may vary from organism to organism. Generally, a promoter is 
required which is capable of binding RNA polymerase and 
promoting the transcription of an operably-associated nucleic 
acid sequence. Such regulatory regions may include those 5'- 
non-coding sequences involved with initiation of 
35 transcription and translation, such as the TATA box, capping 
sequence, CAAT sequence, and the like. The non-coding region 
3 • to the coding sequence may also be retained or replicated 
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of a nucleic ."rLrelr " " yl " i0n SitSS - *» -fences 
associated" when th^ "* «" 

onto the same BMA tranT* .. sequence, to he transcribed 

begun in cne sequence to oe 7 "* 
stance. » po^cWonic \ T i " tC ' SeC ° nd 
Two or .ore seooeZs "ul Pt ^ "* 

10 nucleic acid serene!' " " Pr °»° ter and «y other 

transc...! * ^ a ™ operably-associated if 

is s ~dd b i e ti w " at - iy - two 

sel.cc.hl! " ' "">"" ^ "-tor „.y contain 
iso^t tenable marker genes for initially 

isolating, identifying or tracking ' 
contain donor DNA »!„ 11 9 * or 9»"i s "»a that 
but not limited ^ Ccilun T° reSiStan °* «* - 

» apramycin or gentemyc" 8r. u ' t T" 01 "' ° hl<> "»P h »^ol . 
193:179-187) sort K 1984 ' Mo1 Ge " Genet 

»anip U i,t i ::„ oT S t h r~s <H ; p r e : ai - i98s - 

inn.. Foundation UK^ cZ k Laborat °'y Manual. The John 

17,*:"„-» 2 r =an S aT M h (JOhnS ° n " ^ * —» 

antihiotic .e^on ° *" " - 

eonven^tT^d^t" c7 ^ ^ « 

, B ° restriction sites to allow sevprinr, 

e»ressi" rran,In9 P ° rti ° ne " ^ « insert. inaT * 
expression construct. 

host o rganisB , L iLeg^Ln" fthT T"" ^ " "~ 
35 chromosome. Such seoulnc^ r '"^ ^ h ° St 

*• Plication ori :~: to ::i T ude but are not iimitea 

CARS,, centromere DNA ' and tel sequences 

and te lomere DNA. it may also be 
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advantageous to inclnrto *-w 

0 *™< M include an the expression vector, host 

»ay facilitate Lte^tL D 1 th " ^ ^ ^ S " eS 

5 the host chro m oso Me ^^1111 K 6XPreS " C ° nStrUCt int ° 
Dositio nR „7 eSpec;Lall y whe « they are found in 

positions flanking the cloning site in the cloning vector 

TrlZl C °r rUCt ^ ^ inte *" ted *» no. 

il^p e:T: n a ^ s v° st organism - as a ™' - - 

xo maintained in a hU™"^ ^ * ™ « 

which Gener ! lly ' ^ ^ ^ advant ^— to use shuttle vectors 
which can be replicated and maintained in at least two host 
organisms, such as, for e v amn i 0 <. . 
ceils, bacteria ana y ZZ Tl "* ° a ™" i '" 

15 positive ana graa, negative'ba" ^ °' US ' 0r 9ra " 

invention is c.pab" " " ^o t ' • * """"^ VeCt ° r °' 
strain, k .. "Placating in aifferent speoies or 

22 I °rgsni. M , and nay contain one or more 

origins of replication that aetereine the range o, host 

"P llcatl »" « concert with cell growth. In 

repeat £ ° r . ei<a " Ple ' i£ a »— »»st range plas„id 

replication origin i s present:> the 

capable of stable inheritance in a very „iae rang. „ 

« « al ; 9 r; 9 ', the ° ri9lnS " °' *K 2 (ransegrau 

l^ oloT ^ BiCl - PBBR (Kovach et .1., 

rZlii T ^ 16:80 »- 8 ") are functional in „.„y g ram _ 

M — — — . .grobscteriuj 
dI c":^!: *" d ""«*»■ »»y °f the bacteria that harbor 

3. ITT 7 " h ° 8t ° f "plication are 
r2n7 t !. Pr0<iUCe Of interest. Origin of 
replication that is functional ■!« = n a. • , 
of relate h «- runct ^°nal in a relatively limited range 
ox related hosts can also be us*>rt = „ , . 
. . oe «sea, e.g., the replication 

35 vector „, . . 13 1.2«1-2441). Alternatively, a shuttle 

orpins eachVT nti ° n ^ *~ « — -plication 

ve«or to ne r" " 9 ' " arr °" lt I—", the 

vector to be replicatea ana maintained in the respective 
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^li/'tl^Z 1 ^ BaC i 11US ' ° ri ^ ln ° f -plication 

vector of the Jen^ ^"j^** can be used i„ a 
replication, e.g. fI L baCteri °^ origin o f 
S Present in the vecto^ T'.™ ^ 

can facilitate production^/! ° rigin ° f re P lica tion 

expression construct u " u\ fo 7 tm ° f th " 

but not limited to t™ ! various purposes, such as 

A shuttli I vector : f t T ti0n ' 
10 acting sequences derived To^T^™ ^ Con ^ ise "in- 

transmissible plasmld 1 T naturall y- occ ^ing self- 
themselves trJ^^J^."* * — r 

oy means of an i' n f^ . 

strain of bacteria to another 

15 origins of transfer ' =sguences, known as 

ana can be ~ ~» f -- »«- *> 

facilitate th. transfer of ^ " ^vention to 

different spacias or «»i„ V I*"" 

is a natural process ^ ^.T^ 1 "- C °" J -»"°" 
20 between aifferant spacias „ ! P1 * S, " 1<!s are <=«nsfarrea 
conjugation tube at fair y LTfT ° r9aniS " » 
of transfer origin • The »obi li2 ation 

• specific set If™, 311,9 ^""^ is ™ adiata <* •» 

axpression of function inr "T^ ^ ™ * "»«- * 
» Itself or in tr^ C a ^.' h r "r ^ ^ "°" — 
"80. Proc. Nati AcL « * PlaS ™ id ( ° itta « 

»«, Pl.s.ia 8 is-";- ^ Knauf at al. . 
intagratea Tr. functus, aT"'*" ~" 
American Type culture Collection ' "™ aV * llable £ ™» 

30 By using a shuttle vector wit-K 

replication origins tran!f! *PPr°priate 
-chanisms to clstruct Till "* Sel6Ction 

donor organisms £T^h ^ ° NA se * uen <~s of the 

one initLlZeLs of hlsT ^ ^ 

35 alternative speci" of h ! ° rganiSm ^ 3 ° f 

sequences can bl sta^ 0r * an W where the donor DNA 

Thus, mooiliLble ' ne "plicated and expressed, 

oilizable gene expression libraries that are 
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constructed with a shuttle Ve cf 

into multiple host organise h *"* ***** Can be 
scope of the invention. * C ° njugation are within the 

For instance, a prefers ^ 
5 vector-host organism combi ! ^ eXemplar y expression 
and the ^eriLa ™ t , " ^ C ° Said ' S ^os ! 
are commercially available frl *"~ B1US both of which 

The vector accepts through i TJTT^ "~ J ° lla ' ' 
ranging from 30-42 kbp i„ cloning site dna inserts 

10 resistance marker (neoR, a^*' ^ ° arries a neomycin 
for expression in ^"1^ ^ * 
an ampicillin resistance cen. , r * 1S ° cont ains 

cells. The coli host Select i°n in prokaryotic 

restriction systems (h sdR T T deficient certain 
XS endonuclease-deficient (end^ '^T" ^ ' is 

(recA) . The host organise 1 ' T recomb ination deficient 
carrying cytosine anl/or adenT ° NA 

Present in eukaryotic DNA and cnni e '^^ a ^^ 0n ' ^ ±S ° fte " 
dNTP analogs. ^ synthesieed using methyl- 

20 Advantages of thic ^ 

h i,hly efficient la ^l ZTtriTT ^ Utili * ati - «* 
initially generating a libJrv PaCka91ng Syste » s *~ 
»inus, coli hosts si b ^ f "friction minus , recA 

DNA may be lower than that ° f SOUrce genomic 

25 transformations, packaged V required for naked dna 
P-tected against C a g : d J o e n n ° Ml 0 C "* ^ 
cell, damagec-. inserts Ly be r " *" C ° Ji h ° St 

™* repair mechanic. Le svsT"^ * ^ Cellui ~ 
amounts of startina «™ requires only Slnall 

30 *ay not be required s\nc e le^ V " ' ^ ^ 

inserts in a certain size ral 9 °" ly acce P ts 

coli ma y be amplified to Jl initial libra ^ H» 

"ay be used in high efficient SUPerC ° iled cos »i« DNA which 
introduction into other excre« tra " Sf0rinatio " -thod. for 

The SuperCos , Tjct * * h ° St ° rganis -- 

toning in . str.J^T^ !" ~ '~ 

of replication an£ th^ neoR ol r6PlaCin9 ^ SV4 ° ° ri * in 

neoR gene wxth the Stre P to my ces origin 
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of replication (e.a «-k 

the thiostreptj resistance " " P " 101 °* P " 922) • — 
termed Streptocoe TsTlZllZl ShU " le Ve ° t0r ' 

isolation of the 4 „ L t ' 00 " st ™«ed »Y 

5 19.5. Genetic tr0 " PlJ6 " '""^ « .1. 

Manual The John t " " ""P*"**-'- * laboratory 

orioin'airtrtn 0 ~~ n : containin9 tte ™ 

KPM and Hindln This T oene by digestion with 

and cloned into supercos at'T*/' """"" " ^ 
» -it.. <s.e Bier..„ P i 99 C : T'™*' 
in addition, sequence ei.Jl " ed ejiam P 1 «> • 

cosmid mobilisatio^!. int ™*" ! -«» "r shuttle 

oont,i„in g S tr. P to»'iss-sl 0 *rr 
is into the vector ad^cent to th » Pr0 '"° ,:ere My be *■*«"»"— 

PC*, streptomyces promoL t raL ?" ° l0,,in9 ^ "* USi " 9 
can be directional!^, laments may be generated that 

supercos r r: t ^ into th * Noti/EcoRi ° f 

used including ZtlZ ZlT'T "* 

20 hrdB (Buttner. M.J. i 9 e 9 " ' HolM ^">^. « = «9) and 
"59,. Moreover, one "'J™ T ' V01, PP - 

engineered into ^re^tocos to ZuLT" 0r i i9i " ^ * 
maintenance of the vector - 

» reprrc«r s/j,cti "™ j ' eet6 ' spe ° ies - — — w. 

Chans " 19 " deFiVed fr ™ » SG5 *">» Strepcomyoe. 

ghanaensis may be used- o<;r«; k» , fi-ojnyces 

be compatible a„„ \ P SG5 ~^sed vectors have been shown to 

remT ! therefor « coexist with origin of 

replication from other y or 

3 r„ ^treptomyces plasmids, such as SCP2* 

30 £ 3 ;. P .: 01 a nis PSVH1 <MUth ^ M01 - -net ' 

-iae ranged ^^ZsZ^T^ " ' 

may be modify • *. systems - Accordingly, SuperCos i 

..Lcta^L":^ te 0 ! ^ rePli0a "°" 

-other e„empLy ^TJZlZTZZlTZ " d ' Si " d - 

- P ias. ia , factor (^^-^t.- 
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Sci 89: 8794-8797). The , 

capable of handling > 300 tlV^ PlaSmid vect °r is 

stably in E . coli * ^ J* T ™* *** **» -^ined 
PBAC is modified by the in*"V° inVention ' *he Plasmid 

5 derived from the broad ho^ °" ° f * 

replication origin SCP 2 ' / ""^ " " B ' the 

t al. 1985, oene 35:^3^^^ 

9ene. °' ' and the apramycin resistance 

For combinatorial 
*• cells as hosts, the ^rlTsiZ^T^ Usin * P^nt 

»ay be driven by any of a n w d ° n ° r C ° ding se ^oe 

Preferred strains are described" • ^ Pr -° tBrB - F ~ —pie, 
Manipulation 1985, R w o^n " Principles «* Gene 
Blackwell Scientific ' Pu b . V e^ ^ Primr ° Se 3rd 

« cloning vectors and their',, A ""^ ° f Ocular 

uneii: uses i qq d r> T _ 

Denhardt, Butterworths Pub * Rodriguez, D. T. 

Cloning 1988, B. Perbal t'k PraCtical ^uide to molecular 
such as the 35s pn A and 19 s ^ V S ° nS ' Viral promo te " 
1. "84, Nature aiorsn^f Pr ° m ° ters of ^aMV (Brisson et 
20 of tmv (Takamatsu et al 19 ll' Z *** CO " t protein Promoter 
used; alternatively pi a „t ^ J ' 6:307 " 311 ) ™ay be 

subumt of R uBxsco y ; c L U22 i p e TT rs such as the sraai1 

Broglie et .j . 1984 ^J*" * ^ 1984 < ^MBO J. 3:1671- 1680 ; 
P-oters, e.g., ^ ~ ™ ^ ' « -«* 
« al. a986, Mo3. Cell Biol T " hSf>17 - 3 ~ B ^rley et 

Both pl^t oll ls ' 6:559 - 565 > »ay be used, 
cells. P1 ' _ ^ ^oplasts may be used as hncf 

those of maize, J^ r ^^ *"* «>• 
carrots, peanut, pota^, Tua ar l T' t0baCC °' 
30 Arabidopsis, rape seed and D l ' SUnfl ° Wer ' W*. 

Preferred because of the H Plant proto P^sts are 

Potential to proliferate ^ ~ 11 — ^eir 

into a pi an t. 11 cultu res, and to regenerate 

35 Plant-ex^res^ ^J^^ C ° nStrUCts -»nprise 

include, but are not lifted Z which 
resistances, (e.g., resistance I T" ^ """^ ^^tio 

sxstance to kanamycin or hygromycin) or 
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^H id6 .r SiStanCe (e - g " resist — *° sulfonylurea, 
phosphxnothricm, or glyphosate, . screenable markers 
include, but are not be limited to, genes encoding B- 

s i 0 rT idase (jeffers ° n ' ^> ***** ^ ^ ReP 5 : 3B7 - 

5 405), lucxf erase (Ow et al. 1986 , science 234:856-859), and 

ITiTiJir rS9UlateS an ^° Cyani " P^-t production Goff 
et al. 1990, EMBO J 9:2517-2522). 

To introduce donor organism DNA into plant cells, the 
Agrobact eriu* tumefaciens system for transforming plants may 
10 be used. The proper design and construction of such T-dnI 
based transformation vectors are well known to those skilled 
in the art. Such transformations preferably use binary 
Agrobacterium T— DNA vectors (Bevan, 1984, Nuc. Acid Res. 

15 al ,lll B721) -' ^ ^ CO " CUltiv ^ Procedure (Horsch et 
15 al. 1985, Science 227:1229-1231). Generally, the 

tZTTT 1 ™ tranSf0rmati0 » -y-tem is used to engineer 

16 357 MIT PlantS (B6Van 6t a1 ' 1982 ' Ann ' Re - 
16.357-384; Rogers et al. 1986, Methods Ensymol. 118:627- 

641) , but it may also be used to transform as well as 

20 transfer DNA to monocotyledonous plants and plant cells. 

Van e sirr 1Steen ^ ^ 1984 ' EMB ° J 3:30 "-^41 ; Hooykass- 
Van Slogteren et al. 1984, Nature 311:763-764; Grimsley et 

Mol Bil; 325:1677 " 167 ^' Boulton et al. 1989, Plant 

in other embodiments, various alternative methods for 
introducing recombinant nucleic acid constructs into plant 
cells may also be utilized. These other methods are 

30 lllT 111 ?* USefUl Where t3rget is 3 -nocotyledonous 

me a r, Ce ^ Alternative *™ transfer and transformation 
methods include, but are not limited to, protoplast 
transformation through calcium-, polyethylene glycol (P EG )- 
or electroporation-mediated uptake of naked DNA (see 

35 lllTlT "V 1 "' 1984 ' EMB ° " 3:2717 -"' ^trykus et al. 
1985, Molec. Gen. Genet. 199:169-177; Fromm et al., 1985, 

Proc. Nat. Acad. Sci. USA 82:5824-5828; Shimamoto, 1989, 

Nature 338:274-276) and electroporation of plant tissues 
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(D'Halluin et al.. 1900 D1attf „ 

methods for plant cell ^ 4:1495-1505). Additional 

ror plant cell transformation include microinjection 

P anT™ r iated ° NA ^ <"»^~ - al.: 3 i 990 ' 
Plant Cell Reporter 9: 415-41 a\ a «o ' 
- w , , * w 418) , and macroprojectile 

gjgj 85: 4305— 4309; Ooraon-K.™ et al., 1990 , Plant Cell S " 603 . 

For general reviews of plant molecular biology 

1. ^trr S ' e ' eXMPle ' <■ weiesbach, i 988 

1^ !~ Mol - ul « "ol^. Acetic Press. HV 

Section VIli, pp. 421-463. r- • 

i ff- «63; and Grierson & Corev iori» r»i 

Molecular Biology 2d Ed „• <-°rey, i 98 8, Plant 

9y ' 2(J Ed '' Blackie, London, Ch. 7-9. 

» 0 wh^ 9n . inSeCt System ' Autographa californica nuclear 
polyhydrosis virus .Acnpv, =» i,-, , 

(acnpv) a baculovirus, is used as a vector 
15 to express donor genes in - vector 

donor nM , 9 ln s P°doptera frugiperda cells. The 

donor DNA seance may be cloned into non-essential regions 
(for example the polyhedrin gene) of the virus and placed 

oolvh 7 ° f " ACNPV Pr ° m0ter <*" •»»«*• th« 

20 u S TlTZ Pr : m T r) ' Th6Se —^-nt viruses are then 
used to infect host cells in which the inserted gene is 
expressed. (e.a. see omifK 4. , 

Smith it o r 6t al - 1983 ' J Viro1 46:584; 

Smith, U.S. Patent No. 4,215.0511. 



inducible I I ° f VSCt0rS contai ^^ constitutive or 

2S <IT * Pr ° m0ters ™y -ed with Saccharoses cerevisiae 
Pichla T^' SChiZ ° S »~*«™y^ P-b. (fission yeas" 
yeast!, T llS ' ^ ^— ^ P-^orpha (methylotropic 
bIo^' vT 9 CUrrent P ^ocols in Molecular 

so pTi ^Z^-T^ TIT. ■ 

153 on m fi a a a „, 1987 ' Acad - Press, N.Y., Vol. 

Press „ t ' 1986 ' ° NA C1 ° nin ^ Vol « «• IRL 

g"! ; *: D * C " 3; Bitt -' "87, Heterologous 

3S rS^STT " YeaSt ' Meth ° dS 10 E "^ology, Eds. Berger 
Mo^ T N ' Y " V01 ' 152 ' »■ 673 - 684 '' and The 

Strather 3 ; ! 7 °' °" '~ t myces, 1982, Eds. 

Strathern et al., Cold Spring Harbor Press, Vols. I and II 
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An exemplary vector, pDblet (Brun et al. 1995, Gene, 164:173- 
177), that can shuttle between E. aoli and S. pombe, and 
modification of the vector is described in section 5.5.7. 

The invention also provides a series of S. pombe/E.coli 
5 cosmid vectors, e.g., pPCos+ura (Figure 13) , pPCosl (Figure 
14) that contain a multiple cloning site, a ColEl origin and 
the ampicillin resistance gene respectively for replication 
and selection in E. coll, an autonomously replicating 
sequence (ARS) and the ura4 gene for maintenance and 
10 selection in S. pombe, an SV40 origin, and dual cos sites for 
in vitro packaging in X phage. The construction of pPCos+ura 
and pPCosl is described in section 5.5.7. 

In mammalian host cells, a variety of mammalian 
expression vectors are commercially available. in addition, 
15 a number of viral-based expression systems may be utilized, 
in cases where an adenovirus is used as an expression vector, 
the donor DNA sequence may be ligated to an adenovirus 
transcription/translation control complex, e.g., the late 
promoter and tripartite leader sequence. This chimeric gene 
20 may then be inserted in the adenovirus genome by in vitro or 
in vivo recombination. insertion in a non-essential region 
of the viral genome (e.g., reg ion El or E3) will result in a 
recombinant virus that is viable and capable of expressing 
heterologous products in infected hosts. (e.g., See Logan & 
25 Shenk, 1984, Proc. Natl. Acad. Sci. (USA) 81:3655-3659). The 
Epstein-Barr virus (EBV) origin (OriP) and EBNA-1 as a trans- 
acting replication factor has been used to create shuttle 
episomal cloning vectors, e.g., EBO-pCD (Spickofsky et al. 
1990, DNA Prot Eng Tech 2:14-18). Viral vectors based on 
30 retroviruses may also be used (Morgenstern et al. 1989, Ann 
Rev Neurosci, 12:47-65). Alternatively, the vaccinia 7.5K 
promoter may be used. (See, e.g., Mackett et al. 1982, Proc. 
Natl. Acad. Sci. (USA) 79:7415-7419; Mackett et al . 1984, J. 
Virol. 49:857-864; Panicali et al. 1982, Proc. Natl. Acad. 
35 Sci. 79:4927-4931). 

A number of selection systems may be used for mammalian 
cells, including but not limited to the Herpes simplex virus 
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thymidine kinase (Migler _ , t ^ ^ ^ 
^3"'" P"°«P-ori«»o,yltra„. f .r,« (Szybals*. « 

edenxne Phosphotransferase (Lo^, et al. 19. £x 
5 22.817) gen., c.n be employed in t*', hgprf or eprt Lll s 
respectxvely. Mso , antimetabolite resista „ oe ^ ^ ^ 

w„r=r=e! f e0 " On (dhfr), 

»80 ».tl ' *> -^otrexate ,„i gler , 

" ' AOM - SOi - USS "••3567; O'Hare. et .1. 198 1 

resxstance to mycophenolic acid (Mulligan « B erg. 1981) 
Proc. Natl. ac.o. Sci . USA 78!2072); 1b > - 

phoaphotrenster.se (neo, , whloh confers resistance to the 
18 ZT soT 6 °" 418 '-^-.-oarepin. et al. 1,81. J Kol . 
Bxol. iso.l); and hygromycin phosphotransferase <hyg) . which 
confers resistance to hygromycin .Santerre, et ax."^ ceL 

The present invention „so provides specific 

2. of I 10 :.!": f TT "^"^ th " — Performance 

xxbr.rle! "P"" 1 "" libraries. Wben the 

~tlb ^ " PUrp ° S ' °* generating secondary 

"ore! tOXlCitY °* tte «^-— — »>* t. -der- 

xiorarv x"" " organisms in the 

25 oraT7' '"^^ °' invention, the host 

tt. host ^ °° dlfied S ° th " ' he — -tviv.1 o, 

the host org.nxsm is less adversely affected by the 

production or compounds of intsrest. The iacreased tolerance 
can reduce the loss of boat org.nisms th.t .re producing^ 

3. stage! " " SCree " in9 " »«" - «" Paction 

introd'txorT B ° difi " ti °" - host organis. is the 
introduction, into and/or over-production of ective drug 
efflux system in the host organism. Membr.ne-associated 
35 '"^ Playe ' ™ 30r in resistance in 

cells (Nxkaxdo 1994. science 284:182-388; Balsi et al. 1994 
Biochim Biophys Acta n»7.,« M ' 
pnys Acta 1187:152-162; Gottesman et al. 1993, Ann 
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Rev Biochem 62:385). A modified host organism having an 
enhanced complement of efflux systems can actively secrete a 
broader range of potentially toxic compounds, thus reducing 
their accumulation inside the host organism. Negative 
5 feedback mechanisms, such as end-product inhibition of the 
metabolic pathway producing the compounds, may be avoided. 
Moreover, the isolation of the compounds may be made more 
efficient since the compounds of interest do not accumulate 
inside the host organisms. 
10 in bacteria, a large number of efflux systems have been 

studied which can pump out a wide variety of structurally 
unrelated molecules ranging from, for example, polyketide 
antibiotics (acrAE genes of *. coli, Ma et al. 1993, J 
Bacteriol 175:6299-6313), f luroquinolines and ethidium 
IS bromide (bmr of Bacillus subtills and nor A of Staphylococcus 
aureus, Neyfakh et al. 1993, Antimicrob Agents Chemother 
37:128-129), doxorubicin (drr of Streptomyces peucetius , to 
quaternary amines ( qacE of Klebsiella aerogenes and mvrC of 
B. coli) . see Table in for a list of non-limiting examples 
20 of efflux systems. Any such efflux systems may be used in a 
prokaryotic host organism. 

In yeast, many genes conferring pleiotropic drug 
resistance encode efflux systems, and may be useful in the 
present invention. For example, the bfrl+ gene confers 
25 brefeldin A resistance to Schizosaccharomyces pombe, and the 
CDRl gene of Candida albicans confers resistance to 
cyclohexamide and chloramphenicol (Prasad et al. 1995, Curr 
Genet 27:320-329). 

For mammalian cells, the multidrug resistance proteins 
30 which belong to the class of ATP-binding pump protein may be 
used (Juranka et al. 1989, FASEB J, 3:2583-2592; Paulusma et 
al. 1996, Science 271:1126-1128; Zaman et al. 1994, Proc. 
Natl Acad Sci, 91:8822-8826; Breuninger et al. 1995, Cancer 
Res 55:5342-5347, Koepsell EP 0699753). The human mdrl 
35 multiple drug resistance gene has been functionally expressed 
in Saccharomyces cerevisiae (Kuchler et al. 1992, Proc Natl 
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Table III: 

List of compounds that are secreted by 
active drug efflux systems 



chemical class 



specifi c name 



efflux systems 



10 



cationic dyes 



basic antibiotics 



hydrophilic antibiotics 



rhomadamine-66 
ethidium bromide 

acriflavine 

puromycin 
doxorubicin 

novobiocin 
macrolide 



15 



hydrophobic antibiotics beta-lactams 



20 



organic cation 

uncharged 

weak acid 

zwitterions 
detergent 



tetraphenyl 
phosphonium 

taxol 

chloramphenicol 

nalidixic acid 
mithramycin 

f luor oquinol i nes 

SDS 



bmr 



acrAE 
bmr 

drr , mdr 
acrAE 



bmr 

mdr 
bmr 

emr 
mdr 

bmr 

acrAE 



25 



30 



35 
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One or more efflux systems may be introduced, induced or 
overproduced into a host organism. The genes encoding 
components of an efflux system may be introduced into in a 
host organism and expressed using the expression vectors and 
5 technxgues described above. m some instances, it may be 
advantageous to use an inducible promoter for expression of 
the efflux system genes. 

o 5 ' 1 * 4 - g?ggg^g RIAL NATURAL PATH WAY EXPRESSION 

The present invention relates to the construction and 
uses of combinatorial gene expression libraries, wherein the 
host organise contain genetic material encoding natural 
bxochemical pathways or portions thereof that is derived from 
5 a plurality of species of donor organisms, and are capable of 
producing functional gene products of the donor organisms. 
Bxochemical pathways or portions thereof of the donor 
organisms are thus functionally reconstituted in individual 
host organisms of a library. Novel activities and compounds 
, of such biochemical pathways may be more accessible to 
screening by traditional drug discovery techniques or by 
methods provided herein. 

Either DNA or RNA may be used as starting genetic 
inaterxal for preparing such libraries which may include cDNA 
lxbraries, genomic DNA libraries, as well as mixed 
cDNA/genomic DNA libraries. DNA fragments derived from a 
plurality of donor organisms, e.g., organisms described in 
Sectxon 5.1.1, are introduced into a pool of host organisms, 
such that each host organism in the pool contains a DNA 
fragment derived from one of the donor organisms. 

It may be advantageous if the host organism and the 
donor organisms share certain genetic features, such as 
sxmxlar GC content of DNA and common RNA splicing mechanisms, 
or physiological features, such as optimal growth 
temperature. It may thus be desirable to use a host organism 
that xs phylogenetically closely related to the donor 
organisms. For instance, a prokaryotic host organism may be 
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more desirable for cloning and expression of operons of other 
pr okaryotes . 

Donor organisms that are not amenable to traditional 
drug discovery or drug development technologies may be 
5 preferred. For example, most marine bacteria are poorly 
characterized and not amenable to conventional terrestrial 
microbiology protocols. The present invention can simplify 
the development of production and purification processes. 
The fragment of donor DNA that is transferred may 

10 comprise coding regions encoding functional proteins of a 
complete biochemical pathway or portions thereof, as well as 
natively associated regulatory regions such as promoters and 
terminators. Optimal results may be obtained by using large 
prokaryotic genomic DNA fragments which have a greater 

15 probability of encoding an entire biochemical pathway. If 
the native function and organization of the transferred DNA 
fragment is maintained in the host organism, the genes of the 
donor organism may be coordinately expressed. Also provided 
are exogenous regulatory regions that may be attached to the 
20 DNA fragments so as to ensure transcription of the 

transferred genes in the host organism, thereby replacing or 
supplementing transcription initiated from the native 
promoters . 

Interestingly, many of the genes derived from marine 
25 bacteria have been found to utilize the native promoters to 
express functional proteins in E. coli. Thus, genes of 
marine microorganisms may be expressed even without the need 
to use exogenous regulatory regions. An exemplary list of 
marine bacterial genes that uses its native promoter in B. 
30 coll is provided in Table IV. 



35 



- 52 



WO 98/17811 



PCI7US97A9958 



10 



15 



20 



25 



Table IV: 

in S 5.SLi arlne bacterial senes that use 



Genefsl 

kappa-carrageenase 
(cgkA) 

Na+/H+ antiporter 
(NhaA) 



phosphodiesterase 
(cpdP) 

chitinase 



Genus & Spenipg 

Al teromonas 
carrageenovora , 
gram(-) aerobe 

Vibrio 

alginolyticus 



Vibrio fischeri, 
symbiont 

Al teromonas sp. , 
Strain 0-7 



t *} bu *J 1 tin . Al teromonas sp. 

chloride resistance M-l, gram(-) rod 



dagA-complementing 



vibriolysin (nprV) 



tetracycline 
resistance 



melanin synthesis 
(melA) 

30 DNA modification 
cluster 



Al teromonas 
haloplanktis , 
gram(-) 

ViJbrio 

proteolytics , 
gram(-) 

Vibrio salmonicida, 
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In a preferred embodiment, the method of the invention 
takes advantage of the way that genes of prokaryotes, such as 
bacteria, are organized into discrete functionally-related 
gene clusters in the genome, termed operons. In these 
5 clusters, genes encoding components of a biochemical pathway 
are linked together to common regulatory sequences. 
Functionally related genes in filamentous fungi 
(Actlnomycetes) are also known to be clustered. Gene 
clusters for many bacterial and actinomycBte , and few 
10 eukaryotic fungal, biosynthetic pathways have been isolated 
and characterized. For example, twelve proteins used to 
produce the carotenoids zeaxanthin and beta-cryptoxanthin de 
novo in Erwlnla herblcola, can be activated and produced 
synchronously in the bacterium E. coll (Perry et al. 1986, J. 
15 Bacteriol, 168:607-612; Hundle et al. 1991, Photochem and 
Photobiol 54:1:89-93). m addition, prokaryotic amino acid 
biosynthetic pathways such as leucine and isoleucine 
biosynthesis, as well as glucose transfer systems are also 
contained in discrete clusters. Thus, when prokaryotes are 
20 used as donor organisms, it is likely that genes that are 
functionally related in a biosynthetic pathway would be 
isolated in one clone. 

Donor organisms having compact genomes that contain 
relatively few non-coding regions are preferred. In many 
25 aspects, the donor organisms are bacteria which have a 

relatively small genome, for example, 4400 kbp in length for 
E. coll, and 2500-3500 kbp for archaebacteria. The number of 
independent clones required in a library to achieve a 99% 
probability of containing all of the sequences of the donor 
30 genomes is calculated from the following formula (Clarke et 
al. 1976, Cell 9:91-99): 



In (1-f) 

35 
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N = number of recombinant clones necessary in the 

library 

P = the probability a sequence is represented 

f = the fractional proportion of the genome in a 

5 single recombinant clone 

For exaieple, E. coll has approximately 4400 kbp of DNA; 
a cosmid vector can package approximately 40 kbp of DNA. 
Following these calculations, the entire genome of E» coll 
can be expected to be thoroughly represented in as few as 504 

10 clones in a cosmid library. Since a typical DNA library can 
contain 500,000 independent recombinant clones, one such 
library can effectively represent the genomes of up to 1,000 
different bacterial species having a genome size similar to 
E. coll. Thus, considerable chemical diversity can be 

15 generated and assessed efficiently by screening a gene 

expression library comprising the diverse genetic material of 
1,000 or more species of bacteria. 

The procedures described in standard treatises, e.gr., 
Maniatis et al. 1989, Molecular Cloning, 2nd Edition, Cold 

20 Spring Harbor Press, New York; and Ausubel et al., Current 
Protocols in Molecular Biology, Greene Publishing Associates 
and Wiley Interscience, New York, may be followed to carry 
out routine molecular biology reactions used in constructing 
the combinatorial gene expression libraries . 

25 A cloning strategy for combinatorial natural pathway 

gene expression library is shown in Figure 3. Any cell from 
a donor organism can potentially serve as the source of 
nucleic acid for construction of a gene expression library. 
Genomic DNA, which includes chromosomal DNA as well as DNA of 

30 extrachromosomal genetic elements, such as naturally 

occurring plasmids, may be used. Alternatively, RNA of a 
donor organism may be used. RNA, preferably messenger RNA 
(mRNA) , may be extracted, purified and converted to 
complementary DNA (cDNA) by any technique known in the art. 

35 An oligo-(dT) primer or random sequence primers may be used 
for priming first strand synthesis of cDNA. DNA inserts may 
optionally be amplified by polymerase chain reaction (PCR) . 
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Genomic DNA and RNA may be extracted and purified by the 
procedure provided in Section 5.1.2 or by those that are 
known in the art. For filamentous fungi and bacteria, such 
procedures may comprise any of several techniques including 
5 a) rapid SDS/high salt lysis of protoplasts prepared from 
young mycelia grown in liquid culture and immediate 
extraction with equilibrated phenol; b) rapid lysis of 
protoplasts in guanidinium isothiocyanate followed by 
ultracentrifugation in a CsCl gradient; or c) isolation of 
10 high molecular weight DNA from protoplasts prepared in 
agarose plugs and pulsed field gel electrophoresis. For 
bacteria, an alternative procedure of lysis by 
lysozyme/ detergent, incubation with a non-specific protease, 
followed by a series of phenol/chloroform/ isoamyl alcohol 
15 extractions may be useful. 

For optimal results, large random prokaryotic genomic 
DNA fragments are preferred for the higher probability of 
containing a complete operon or substantial portions thereof. 
The genomic DNA may be cleaved at specific sites using 
20 various restriction enzymes. Random large DNA fragments 

(greater than 20 kbp) may be generated by subjecting genomic 
DNA to partial digestion with a frequent-cutting restriction 
enzyme. The amount of genomic DNA required varies depending 
on the complexity of the genome being used. Alternatively, 
25 the DNA may be physically sheared, as for example, by passage 
through a fine-bore needle, or sonication. 

Prior to insertion into a vacant expression vector, such 
DNA inserts may be separated according to size by standard 
techniques, including but not limited to, agarose gel 
30 electrophoresis, dynamic density gradient centrif ugation, and 
column chromatography • a linear 10-40% sucrose gradient is 
preferred. The insertion can be accomplished by ligating the 
DNA fragment into an expression vector which has 
complementary cohesive termini. The amounts of vector DNA 
35 and DNA inserts used in a ligation reaction is dependent on 
their relative sizes, and may be determined empirically by 
techniques known in the art. However, if the complementary 
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restriction sites used to fragment the DNA are not present in 
the expression vector, the ends of the DNA molecules may be 
enzymatically modified, as for example, to create blunt ends. 
Alternatively, any site desired may be produced by ligating 
5 nucleotide sequences i.e., linkers or adaptors, onto the DNA 
termini; these ligated linkers or adaptors may comprise 
specific chemically-synthesized oligonucleotides encoding 
restriction endonuclease recognition sequences. in an 
alternative method, the cleaved expression vector and DNA 
10 inserts may be modified by homopolymeric tailing. 

After ligation of vector DNA to DNA inserts, the 
expression constructs are introduced into the host organisms. 
A variety of methods may be used, which include but are not 
limited to, transformation, transf ection, infection, 
15 conjugation, protoplast fusion, liposome-mediated transfer, 
electroporation, microinjection and microprojectile 
bombardment. in specific embodiments, the introduction of 
bacteriophage or cosmid DNA into an E. coli host is carried 
out by in vitro packaging the DNA into bacteriophage 
20 particles then allowing these particles to infect E. coli 
cells. other naturally-occurring mechanisms of DNA transfer 
between microorganisms may also be used, e.g., bacterial 
conjugation. 

After the host cells containing expression constructs 
25 are pooled to form a library, they can be amplified and/or 
replicated by techniques known in the art. The purpose of 
amplification is to provide a library that can be used many 
times. Amplification may be achieved by plating out the 
library, allowing the bacteria to grow, and harvesting the 
30 phage or bacteria for storage. 

Alternatively, the library may be stored in an ordered 
array. The bulk of the library can be plated out at low 
density to allow formation of single, discrete plaques or 
colonies, followed by transfer of individual plaques or 
35 colonies into the wells of coded multi-well master plates, 
e.g., 96-wei:: plates or 384-well plates. The individual 
clones are allowed to grow in the wells under the appropriate 
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conditions. The coded master plates can be used as an 
archival source to replicate each clone separately into one 
or more working plates. Thus, each clone in the library may 
be handled and assayed individually. The coded archival 
5 plates may be sealed and stored for future use. Replication 
and transfer of the clones may be done with a multi-pin 
replicator, or multi-channel devices for fluid handling. 
Preferably, all or most of the transfers and manipulations 
are performed by laboratory robots (Bentley et al. 1992, 
10 Genomics 12:534-541). 

The libraries of the invention may be preserved by 
lyophilization, or cryopreservation in a freezer (at -20°C to 
-100°C) or ur.der liquid nitrogen (-176«C to -196»C) . 

Host organisms containing donor DNA in a library may be 
15 identified ard selected by a variety of methods depending on 
the host-vector system used. In one approach, such host 
organisms are identified and selected upon the presence or 
absence of marker gene functions, e.g., thymidine kinase 
activity, resistance to antibiotics, such as kanamycin, 
20 ampicillin, bleomycin, or thiostrepton , production of 

pigment, such as melanin, and resistance to methotrexate. 
Alternatively, a change in phenotype or metabolism of the 
host organism, indicated by metabolic testing, foci formation 
in tissue culture, or occlusion body formation in baculovirus 
25 may be used. " Once selected for the presence of donor DNA, a 
series of enzymatic assays or metabolic tests may be carried 
out on the clones for further characterization. 

To characterize the donor DNA inserts in a library of 
clones containing donor DNA or a portion thereof, mini 
30 preparations •of DNA and restriction analysis may be performed 
with a representative set of clones. The results will 
provide a fingerprint of donor DNA size and restriction 
patterns that can be compared to the range and extent of 
insert DNA which is expected of the library. 

35 
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5 - 1 - 5 ' COMBINATORIAL CHT MERIC PATHWAY EXPRESSION LIBRARIES 
The present invention also relates to the construction 
and uses of combinatorial chimeric pathway expression 
libraries, wherein the host organisms contain randomly 
5 concatenated genetic materials that are derived from one or 
more species of donor organisms, and are capable of producing 
functional gene products of the donor organisms. A 
substantial number of host organisms in the library may 
contain a random and unique combination of genes derived from 
10 one or more species of donor organism (s) . Coexpression of 
the cloned genes may be effected by their respective native 
regulatory regions or by exogenously supplied regulatory 
regions. The plurality of gene products derived from the 
different donor organisms interact in the host organism to 
15 generate novel chimeric metabolic pathways and novel 

compounds. Novel activities and compounds of such chimeric 
pathways may become more accessible to screening by 
traditional drug discovery techniques or by methods provided 
herein. 

20 While not limited to any theory of how novel pathways or 

compounds are generated in a combinatorial chimeric pathway 
gene expression library, the coexpression of functional 
heterologous genes derived from one or a plurality of species 
of donor organisms enables the gene products to interact in 

25 vivo with each other, and with elements of the host organism. 
Through such interactions, new sets of biochemical reactions 
will arise, some of which can act in concert to form a 
chimeric biochemical pathway. The heterologous gene products 
may encounter substrates, cof actors and signalling molecules 

30 that are not present in their respective donor organism. 
Such substrates, cof actors and signalling molecules may be 
supplied by the host organism, by other heterologous gene 
products that are coexpressing in the same host organism, or 
from the medium. 

35 Moreover, some of the heterologous gene products may be 

modified structurally, and compartmentalized or localized 
differently during biosynthesis in the host organism. Some 
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of the heterologous gene products may be exposed to a host 
cellular environment that is different from that of their 
respective donors. 

It is envisioned that some heterologous gene products 
5 may also act ; on the host organism and modify the host 
cellular environment. Elements of the host cellular 
environment that may affect, or be affected by, the function 
of heterologous gene products may include but are not limited 
to concentrations of salts, trace elements, nutrients, 
10 oxygen, metabolites, energy sources, redox states, and pH. 
Some heterologous gene products may also interact with host 
gene products which can result in the modification of the 
host's metabolic pathways. 

Depending on the combination of heterologous genes, 
15 novel chimeric biochemical pathways and novel classes of 
compounds that do not exist in nature may be formed in the 
host organisms of the library. In combinatorial chimeric 
pathway expression libraries, the genetic resources of the 
donor organisms are multiplied and expanded to provide a 
20 diversity of chemical structure that may not be found in 
individual organisms. The libraries so prepared may be 
screened using traditional methods or methods provided by the 
present invention. Thus, the novel pathways and compounds 
are made more accessible to drug screening. 
25 Any of the donor organisms described in Section 5.1.1 

may be used in preparing a combinatorial chimeric pathway 
expression library. Donor organisms may be selected on the 
basis of their known biological properties, or they may be a 
mixture of known and/or unidentified organisms. 
30 The combinatorial chimeric pathway expression libraries 

of the invention may be assembled according to the principles 
described in section 5.1.3. m order to allow the random 
concatenation of DNA fragments from multiple species of donor 
organisms, the procedure for library assembly may be modified 
35 by including the following steps: generation of smaller 
genomic DNA fragments, ligation with regulatory sequences 
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such as promoters and terminators to form gene cassettes, and 
concatenation of the gene cassettes. 

Insert DNAs may be complementary DNA (cDNA) derived from 
mRNA, and/or fragments of genomic DNA, or DNA from an 
5 archival or mobilizable combinatorial expression library - 

The DNA or RNA of different species of donor organisms may be 
copurified, or they may be isolated separately and then 
combined in specific proportions. The random mixing of 
insert DNAs can be done at any stage prior to insertion into 

10 the cloning or expression vector. For example, large pieces 
of DNA from an archival library can be isolated and digested 
to give smaller fragments, which are then randomly religated 
to form insert DNAs for a second combinatorial expression 
library. Other methods for generating and mixing of random 

15 fragments of DNA can also be used, for example, in vitro 
recombination can be used when the DNA fragments share some 
sequence homologies. 

Methylated nucleotides, e.g., 5-methyl-dCTP, may be used 
in cDNA synthesis to provide protection against enzymatic 

2 0 cleavage, and allow directional cloning of the cDNA inserts 
in the sense orientation relative to the promoter and 
terminator fragments. 

Random fragments of genomic DNA in the range of 2-7 kbp 
may be generated by partial digestion with a restriction 

25 enzyme, having a relatively high frequency of cutting sites, 
e.g., £au3AI. Partial digestion is monitored and confirmed 
by subjecting aliquots of the samples to agarose gel 
electrophoresis . 

Exogenous regulatory regions, such as constitutive or 

30 inducible promoters and terminators may be provided to drive 
expression of the cloned genes. When the host and donor 
expression systems are not compatible, it is essential to 
provide such • regulatory sequences. PCR may be used to 
generate various promoter and terminator fragments that are 

35 specific to a particular expression host, and have defined 
. restriction s;ites on their termini. Any method for 

attachment of a regulatory region to the DNA inserts may be 
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used. Treatment with the Klenow fragment and a partial set 
of nucleotides, I.e., a partial fill-in reaction, may be used 
to create insert DNA fragments which will only ligate 
specifically to promoter and terminator fragments with 
5 compatible ends. 

The present invention provides a method involving the 
use of gene cassettes which contains two copies of a 
promoter, oppositely positioned on either side of a unique 
restriction site. Any DNA inserted into this restriction 

10 site will be transcribed on both strands by the two promoters 
respectively from both sides. 

The present invention also provides an alternative 
method involving the use of gene cassettes which contain a 
promoter and a terminator positioned on either side of a DNA 

15 insert. If the procedure for directional cloning of cDNA is 
followed, the 5' ends and 3' ends of the cDNA inserts would 
have unique matching restriction sites with the 3' ends of 
the promoter fragments and the 5' ends of the terminator 
fragments respectively. 

20 Genomic DNA fragments or cDNAs bearing compatible 

restriction sites at both ends are ligated to the promoters 
and in some cases, terminator fragments, to form gene 
cassettes having a mean size of approximately 1-10 kbp. 

Concatemers comprising multiple transcription units are 

25 assembled by an approach similar to that used in peptide 

synthesis. A subset of the pool of gene cassettes is bound 
at one end to a solid phase, e.g., a magnetic bead. The 
other free end is subjected to several successive cycles of 
"de-protection" and serial ligation of the remaining pool of 

30 transcription units. The solid phase allows separation of 
the concatemers from the un ligated DNA fragments after each 
addition cycle. When concatenation is completed, the 
concatemers are released by incubation with a restriction 
enzyme, such as an intron nuclease, that cleaves a unique and 

35 very rare site adjacent to the sol id. phase to reduce the 
probability of cleaving the concatenated DNA. Concatenated 
DNA may then be inserted into a cloning vector to form 
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expression constructs which are introduced into the 
appropriate host organisms. Alternatively, the constructs 
may be transformed into an E. coli recA minus strain for 
amplification prior to introduction into the host organisms. 
5 Details of the synthesis of the promoter and terminator 

fragments, the preparation of gene cassettes, the assembly of 
the DNA inserts, and the ligation of insert and vector, are 
provided in Sections 5.4 and 5.5. 

Once the combinatprial chimeric pathway expression 

10 library is assembled, it can be stored, amplified, 

replicated, pre-screened and screened essentially in the same 
manner as described in section 5.1.3. Where the vector 
contains the appropriate replication origins, transfer 
origin(s), and/or selection mechanisms, the genetic material 

15 in the library can be transferred from one species of host 
organism to another species or strain for expression. 

5.1.6. BIASED COMBINATORIAL EXPRESSION LIBRARIES 
In another embodiment of the invention, a biased 

20 combinatorial natural or chimeric pathway expression library 
may be prepared from preselected fragments of DNA that are 
pooled together from one or more species of donor organisms. 
Instead of using only the total pooled genomic DNA or cDNA of 
the donor organism(s) , this approach will reduce the number 

25 of clones that need to be screened and increase the 

percentage of clones that will produce compounds of interest. 
The preselected fragments of DNA contain genes encoding 
partial or complete biosynthetic pathways, and may be 
preselected by hybridizing to an initial or archival DNA 

30 library a plvirality of probes prepared from known genes that 
may be related to or are involved in producing compounds of 
interest. 

The initial DNA library, preferably a cosmid or 
bacterial artificial chromosome (BAC) library, and not 
35 necessarily an expression library, may contain DNA from one 
or more species of donor organisms. For further pre- 
screening, if the initial library is an expression library, 
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DNA in the positive clones may be transferred into and 
expressed in a host for production, such as E. coll or 
Streptomyces livldans. More than one initial library may be 
pre-screened, and DNA from all the positive clones can be 
S pooled and used for making the biased combinatorial gene 
expression library. 

The initial or archival library may be amplified so that 
DNA of the donor organisms can be pre-screened in a variety 
of host organisms. in one aspect of the invention, the 
10 cloning vector or expression vector can contain the 

appropriate replication origins and/or transfer origin (s) as 
described in scetion 5.1.3, such that the entire initial or 
archival library can be transferred or mobilized into various 
compatible host organisms via conjugation. The transfer can 
15 also be effected by isolating the donor genetic materials 
from the archival library and introducing the genetic 
material into another species or strain of organism by any 
means, such as but not limited to transformation, 
transfection and electroporation. For example, once a gene 
20 expression library in Streptomyces livldans is generated, it 
can be introduced into specialized host organisms for 
expression and screening, such as s. rlmosis that produces 
oxytetracycline, or S. parlus that produces actinomycin D. 
As another example, an archival library can be constructed in 
25 E. coli, and preselected by hybridization with nucleic acid 
probes to identify genetic materials of interest. The 
preselected DNA fragments can be isolated from the archival 
library, then randomly mixed, and cloned into a mobilizable 
expression vector to form a biased combinatorial expression 
30 library. Such a library is enriched for a particular class 
of metabolic. pathway and can be transferred into different 
host organisms for expression in different biochemical and/or 
genetic backgrounds. 

The probes used for pre-screening may be derived from 
35 any cloned biosynthetic pathway, such as the polyketide 
biosynthetic loci, as these are the best characterized 
biosynthetic loci and there is considerable sequence 
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conservation between the known clusters, e.g., act J 
(actinorhodin biosynthesis - Malpartida et al. 1987 Nature 
325:818-820), whiE (spore pigment biosynthesis - Blanco et 
al. 1993 Gene 130:107-16) and eryAl (Donadio et al. 1991 
5 Science 252, 675-679). Similar principles may be applied to 
other antibiotic or secondary metabolite biosynthetic loci. 
For example the cloned peptide synthetase genes in low-GC 
gram positive bacteria, such as Bacillus (Stachelhaus et al. 
1995 Science 269: 69-72) and in high-GC gram positive 

10 bacteria, such as actinomycetes species that produce 

thiostrepton, virginiamycin, valinomycin and actinomycin, may 
have enough sequence similarities to be used as probes to 
identify new biosynthetic loci in both groups of bacteria. 
Other cloned biosynthetic pathway, such as peptide synthases 

15 and aminoglycoside synthases, can also provide probes for 
pre-screening the initial libraries. 

Alternatively, the initial DNA library may be screened 
by probes derived from DNA that encode proteins involved in 
secondary metabolism. Such probes may be prepared by 

20 subtracting non-coding DNA and DNA encoding proteins that 

relate to primary metabolism biosynthetic pathways from total 
DNA. The remaining DNA is thus biased toward coding regions 
that encode proteins involved in secondary metabolism. 
Details of the subtraction procedure are provided in Section 

25 5.3.5. 

5.2. SCREENING COMBINATO RIAL EXPRESSION LIBRARIES 
The drug discovery system of the present invention 
further encompasses novel methods for screening combinatorial 
30 expression libraries. While standard methods of screening 
expression libraries, such as antibody binding and ligand 
binding, can also be used with expression libraries of the 
present invention, the libraries can be adapted to a reporter 
regimen tailored to identify host organisms that are 
35 expressing the desirable pathways and metabolic products. 

The, methods claimed herein enables the management of 
large sample numbers with minimal handling to permit 
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efficient and high-throughput detection and isolation of 
productive clones in the library. The libraries may be pre- 
screened for a broad range of activities, for the production 
of a class of compounds or for the presence of relevant DNA 
5 sequences. The libraries may also be used directly with a 
target in both in vivo and in vitro assays. The identified 
or isolated population of cells may readily be cultured, 
expanded in numbers, and subjected to further analysis for 
the production of novel compounds. The genes encoding the 

10 metabolic pathway that lead to production of the novel 

activity or compound may be delineated by characterizing the 
genetic material that was introduced into the isolated 
clones. Information on the genes and the pathway, and the 
clones, will greatly facilitate drug optimization and 

15 production. 

As used herein, the terms "library clones" or "library 
cells" refer to host cells or organisms in a combinatorial 
gene expression library that contain at least one fragment of 
donor DNA that may encode a donor metabolic pathway or a 

20 component thereof. The term "positive clones" or "positive 
cells" refers to library clones or cells that produce a 
signal by virtue of the reporter regimen. The term 
"productive cells" or "productive clones" refers to host 
cells or organisms in the library that produce an activity or 

25 compound of interest, in distinction from the remainder "non- 
productive cells" in the library. 

The term "pre-screen" refers to a general biological or 
biochemical assay which indicates the presence of an 
activity, a compound or a gene of interest. The term 

30 "screen" refers to a specific therapy-oriented biological or 
biochemical assay which is directed to a specific disease or 
clinical condition, and employs a target. The term "target" 
refers generally to whole cells as well as macromolecules, 
such as enzymes, to which compounds under test are exposed in 

35 a screen. The use of both pre-screens and screens generally 
embodies visual detection or automated image analysis of a 
colorigenic indicator, fluorescence detection by 
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fluorescence-activated cells sorting (FACS) or the use of a 
magnetic cell sorting system (MACS) performed on a population 
of library cells in the presence of a reporter regimen. 

The methods of the invention provide alternative but not 
5 mutually exclusive approaches to generation of detectable 
signal associated with productive cells for the purpose of 
detecting and isolating these cells of interest. A reporter 
can be a molecule that enables directly or indirectly the 
generation of a detectable signal. For example, a reporter 
10 may be a light emitting molecule, or a cell surface molecule 
that may be recognized specifically by other components of 
the regimen. A reporter regimen comprises a reporter and 
compositions that enable and support signal generation by the 
reporter. The reporter regimen may include live indicator 
15 cells, or portions thereof. Components of a reporter regimen 
may be incorporated into the host organisms of the library, 
or they may be co-encapsulated with individual or pools of 
library cells, in a permeable semi-solid medium to form a 
discrete unit for screening. 
20 To facilitate detection of compounds of interest as 

described in the following text, absorptive materials such as 
neutral resins, e.g., Diaion HP20 or Amberlite XAD-8 resin, 
may be added to cultures of library cells (Lam et al. 1995, J 
Industrial Microbiol 15:453-456). since many secondary 
2S metabolites are hydrophobic molecules, the release or 

secretion of such metabolites may lead to precipitation on 
the cell exterior. Inclusion of such resins in the culture 
causes the seguestration to occur on the resin which may be 
removed from the culture for elution and screening. 
30 m one embodiment of the invention, the host organisms 

are engineered to contain a chemoresponsive construct, 
comprising a gene encoding a reporter molecule operably- 
associated with a chemoresponsive promoter that responds to 
the desired class of compounds or metabolites to be screened 
35 in the expression library. in the presence of the desirable 
activity or compound, the chemoresponsive promoter in a 
positive clone is induced to initiate transcription of the 
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operably-associated reporter gene. The positive cell is 
identified by detectable signals generated by the expression 
of the reporter gene. 

In an alternative embodiment, a physiological probe can 
5 be used which generates a signal in response to a 

physiological change in individual cells as a result of the 
presence of a desirable activity or compound. Such a probe 
may be a precursor of a reporter molecule that is converted 
directly or indirectly to the reporter molecule by an 
10 activity or compound in the biochemical pathway sought. Upon 
contact with a productive cell, the physiological probe or 
reporter precursor generates a detectable signal which 
enables identification and/or isolation of the productive 
cell. Contact may be effected by direct addition of the 
15 probe or precursor to the library cells. Alternatively, 

contact may be effected by encapsulation and diffusion of the 
probe or precursor to the library cells during screening. 

In yet another embodiment of the invention, indicator 
cells may be used to signal the production of a desirable 
20 activity or compound, thereby enabling identification and/or 
isolation of productive cells in the library. Whole live or 
fixed indicator cells, or cellular fractions thereof may be 
mixed or co-encapsulated with individual or pools of library 
cells. Indicator cells are selected for their biological 
25 properties which is responsive to the presence of the 

desirable activity or compound. Indicator cells may be the 
target cells of the desirable compound. Alternatively, 
indicator cells may be used in conjunction with a reporter to 
generate a detectable signal. 
30 Pre-screens and screens for each library are chosen 

after comprehensive characterization of the host organism 
and, whenever possible, of the donor organisms. Assays in 
which the host organisms are positive are disqualified, while 
assays in which the donor organisms are positive are 
35 considered acceptable library pre-screens or screens. 

Substrates are preferably the targets of enzymes relevant to 
desirable biosynthetic capabilities, may be used to 
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alternatively irrelevant targets (e.g., amylase, /S- 
galactosidase) that indicate the presence of transcriptional 
and translational activity for the DNA in a specific clone. 
In yet another embodiment of the invention, antibiotic 
5 resistance may be used as an indicator of production or 
potential production of interesting secondary metabolites. 
When library clones are exposed to a panel of antibiotics, 
resistance to the antibiotics may indicate the presence of a 
self-defense mechanism, such as efflux pumps which are 
XO frequently found adjacent to secondary metabolite 

biosynthetic pathways as protection against auto-toxicity . 
Such clones may not exhibit secondary metabolite production 
at the time of detection, but have increased probability of 
containing adjacent biosynthetic pathways that can be further 
15 manipulated or examined as desired. 

The present invention also provides encapsulation as an 
efficient high-throughput method for growing cells in a 
confined space, replacing the classic method of growing 
bacteria in petri dishes. Growing cells in a plate format is 
20 both labor- and materials- intensive, while encapsulated 
cells can be grown easily in a liquid culture with the 
advantage that dividing cells are kept together, and thus 
facilitating detection of interesting secondary metabolites. 
Another advantage of encapsulation is the ability to co- 
25 encapsulate components of the reporter regimen and/ or other 
indicator cells with library cells so that pre-screening or 
screening may be performed in a discrete unit. Encapsulation 
of cells can be performed easily by means of thermal or ionic 
gelation using materials such as agarose, alginate or 
30 carrageenan. 

FACS is a well-known method for separating particles 
(l-130*im in size) based on the fluorescent properties of the 
particles (Kamarch, 1987, Methods Enzymol, 151:150-165). 
FACS works on the basis of laser excitation of fluorescent 
35 moieties in the individual particles. Positive fluorescence 
results in addition of a small electrical charge to the 
particle. The change allows electromagnetic separation of 
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positive and negative particles from a mixture. Separated 
particles may be directly deposited into individual wells of 
96-well or 384-well plates. 

MACS is a well-known method for separating particles 
5 based on their ability to bind magnetic microspheres (0.5- 
100/ra diameter) (Dynal, 1995). A variety of useful 
modifications can be performed on the magnetic microspheres, 
including covalent addition of antibody which specifically 
recognizes a cell-surface antigen or hapten. Alternatively, 

10 for magnetization of encapsulated cells, a reporter regimen 
can be incorporated into host cells that generate 
magnetogenic reporter proteins, such as ferritin. In this 
case, encapsulated cells that generate a positive signal act 
as magnetic microspheres. The selected microspheres can be 

15 physically manipulated by exposure to a magnetic field. For 
example, the selected microspheres may be sequestered by 
application of a magnet to the outside of the reaction 
vessel. 

20 5.2.1. REPORTER CONSTRUCTS 

According to the present invention, the host organisms 
in the library may be engineered to contain a chemoresponsive 
reporter construct comprising a chemoresponsive promoter 
operably-associated with a reporter gene. The host organism 

25 and/or the construct may contain other genes encoding 

accessory proteins that are involved in the regulation of 
transcription from the chemoresponsive promoter or the 
production of signals. 

A chemoresponsive promoter is any double-stranded DNA 

3 0 sequence that is capable of binding an RNA polymerase and 
initiating or modulating transcription of an operably- 
associated reporter gene only in the presence of a certain 
kind of activity or a certain class of compounds. 
Preferably, the chemoresponsive promoter has no or only a 

35 negligible level of constitutive background transcriptional 
activity in the host organism in the absence of the inducing 
activity or compound. A chemoresponsive promoter that 
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respond negatively to the presence of an activity or compound 
by decreasing or ceasing transcriptional activity may also be 
used. 

Promoters useful in the present invention may include, 
5 but are not limited to, promoters for metabolic pathways, 
biodegradative pathways, cytochromes and stress response 
(Orser et al. 1995, In vitro Toxicol 8:71-85), such as heat 
shock proteins. For example, the Pm promoter of the 
Pseudomonas TOL plasmid meta-cleavage pathway and its 

10 positive regulator Xyls protein which is inducible and 

modulated by a range of benzoates and halo- or alkylaromatic 
compounds may be used (Ramos et al. 1988, FEBS Letters 
226:241-246; de Lorenzo et al. 1993, Gene 130:41-46; Ramos et 
al. 1986, Proc Natl Acad Sci 83:8467-8471; Mermod et al. 

15 1986, J. Bateriol 167:447-454). other non-limiting examples 
of chemoresponsive promoters are promoters relating to 
phosphonate utilization (Metcalf et al. 1993, J Bacteriol 
175:3430-3442), promoters sensitive to cis-cis-muconate 
(Rothmel, 19S0); promoters sensitive to antibiotics and 

20 salicylates (Cohen et al. 1993, J Bacteriol, 175:7856-7862; 
Cohen et al. 1993, J . Bacteriol, 175:1484-1492), promoters 
from the ars€«.nic and cadmium operons from Staphylococcus 
aureus (Corbisier et al, 1993, FEMS Letters 110:231-238); 
sfiA (Quillardet et al. 1982, Proc Natl Acad Sci 79:5971- 

25 5975), zwf (Orser et al., 1995, supra). 

A reporter gene encodes a reporter molecule which is 
capable of directly or indirectly generating a detectable 
signal. This includes colorigenic or magnetogenic reporters 
as well as any light-emitting reporter such as 

30 bioluminesceht, chemi luminescent or fluorescent proteins may 
be used, which includes but are not limited to the green 
fluorescent protein (GFP) of Victoria a&quoria (Chalfie et 
al. 1994, Science 263:802-805), a modified GFP with enhanced 
fluorescence (Heim et al. 1995, Nature 373:663-4), the 

35 lucif erase (luxAB gene product) of Vibrio harv&yi (Karp, 

1989, Biochim Biophys Acta 1007:84-90; Stewart et al. 1992, J 
Gen Microbiol, 138:1289-1300), and the luciferase from 
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firefly, Photinus pyralls ( De Wet et aL 1987, Mol Cell Biol 
7:725-737). Any fluorigenic or color igenic enzymes may be 
used which includes but are not limited to beta-galactosidase 
(LacZ, Nolan et al. 1988, Proc Natl Acad Sci USA 85:2603- 
5 2607), and alkaline phosphatase. Any cell surface antigen 
may be used, for example, E. coll thioredoxin-f lagellin 
fusion protein, i.e., E. coll thioredoxin (the trxA gene) 
expressed as a fusion protein with flagellin (the flic gene) 
on the surface of E. coll flagellae (Lu et al. 1995, 

10 Bio/Technology 13:3 66-372). 

An exemplary chemoresponsive reporter construct provided 
herein is pERD-20-GFP which contains the Pm promoter and the 
XylS gene of Pseudojnonas (Ramos et al. 1988, FEBS Letter 
226:241-2476) that are responsive to certain classes of 

15 benzoates, resulting in transcription and translation 
(expression) of the reporter, GFP (see Figure 6) . 

Different promoter sequences may be generated by PCR and 
attached to the coding regions of GFP or f lagellin- 
thioredoxin reporter. Genomic and plasmid DNA containing the 

2 0 promoter of interest may be purified from the relevant 
species using standard DNA purification methods, and 
resuspended in TE. Primers may be synthesized corresponding 
to the 5 • and 3 • boundaries of the promoter regions with 
additional sequences of restriction sites to facilitate 

25 subcloning. The amplification reactions are carried out in a 
thermocycler under conditions determined to be acceptable for 
the selected template and primers. The reaction products are 
separated by agarose gel electrophoresis, and subcloned using 
the TA Cloning Kit (Invitrogen, La Jolla) . The amplified 

30 promoter sequences may be recloned into a general purpose 
cloning vector in a context 5' to the GFP or flagellin- 
thioredoxin cDNA. 



5-2.2. PHYSIOL OGICAL PROBES AND REPORTER PRECURSORS 
A physiological probe as used herein is a fluorescent or 
colorigenic agent which upon contact or entry, generates a 
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signal in response to changes in physiological and/or 
metabolic parameters of a library cell or indicator cell. 

The probe can be an enzyme substrate linked to a 
fluorogenic agent. For example, a fluorogenic alkyl ether 
5 can be incubated with the cells. If the cell is producing 
polyaromatic hydrocarbons, the hydrocarbons can induce 
microsomal dealkylases, which in turn cleave the fluorogenic 
alkyl ether, yielding a fluorescent product. 

Fluorescent probes may be selected for detection of 
10 changes in the following physiological and metabolic 

parameters such as, but not limited to, those described in 
Shechter, et al. (1982, FEBS Letters 139:121-124), and 
Bronstein et al. (Anal Biochem 219:169-81). 



15 Metaboli c activity 

Decrease in 
membrane potential 



20 



30 



35 



Intracellular pH 



Increase in • 

cy tochr ome-med iated 

oxidation 



25 



Cause 

(specific example^ 

Stress, injury 
(isopropanol) 

Physiological 
changes 



Induction of 

microsomal 

dealkylases by 

polyaromatic 

hydrocarbons 

(naphthalene) 



Stain/ Substrate 
(class of chemical) 

BacLight stain 
( Semi-permeant 
nucleic acid stain) 

BCECF-AM 
(lipophilic 
acetoxymethyl ester 
of phenolic fluor) 

7-ethoxy- 

heptadecyl-coumarin 
(fluorogenic alkyl 
ether) 



5.2.3. PRE-SCR EENING AND SCREENING OF THE LIBRARY 
The combinatorial gene expression libraries of the 
invention may be pre-screened or screened by a variety of 
methods, including but not limited to, visual inspection, 
automated image analysis, hybridization to molecular beacon 
DNA probes (Tyagi et al. 1996, Nature Biotechnol, 14:303-308) 
fluorescence activated cell sorting (FACS) and magnetic cell 
sorting (MACS) . Screening may be performed on bulk cultures 
of unamplified or amplified libraries. 

In specific embodiments of the invention, individual or 
pools of library cells are encapsulated in an inert, stable 
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and porous semi-solid matrix in the form of droplets during 
pre-screening or screening. The semi-solid matrix is 
permeable to gas, liquid, as well as macromolecules, and 
permits the growth and division of encapsulated cells. 
5 Examples of suitable matrices may include but are not limited 
to agarose, alginate, and carrageenan. The encapsulated 
library cells may be cultured and tested in the droplets, and 
remain viable so that the cells may be recovered from the 
droplets for further manipulations. The matrix may 
10 optionally be exposed to substances, such as an antibiotic, 
which can select for library cells that contain a selectable 
marker. The droplets may also be exposed to nutrients to 
support the growth of library cells. The following examples 
are offered by way of illustration and are not intended to 
15 limit the invention in any manner. 

Encapsulation may be performed in one of many ways, 
producing either macrodroplets (droplets from 0.5 to 2.5 mm) 
or microdroplets (droplets from 10 to 250 /un) depending upon 
the method of detection employed during subsequent pre- 
20 screening or screening. The size and the composition of the 
droplets may be controlled during formation of the droplets. 
Preferably, each macrodroplet or microdroplet will contain 
one to five library cells. 

For example, macrodroplets may be prepared using sodium 
25 alginate as follows: sodium alginate is dissolved in 100 mL 
of sterile water at a concentration of l% using an overhead 
mixer at 2000 rpm. A volume of library cells of E. coll or 
yeast, such as Schizosaccharomyces pombe and Saccharomyces 
species; or spores for Streptorayces species; Bacillus 
30 subtilis; and filamentous fungus such as Aspergillus and 

Weurospora species; is added to the sodium alginate solution 
so that 1-5 cells are encapsulated per droplet. The mixture 
is allowed to sit for at least 30 minutes to degas, and is 
then extruded through any device that causes the formation of 
35 discrete droplets. One such device is a syringe with a 25 
gauge needle. The droplets are formed by adding the sodium 
alginate solution drop-wise into a beaker of gently stirring 
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135 mM calcium chloride solution. Droplets are allowed to 
solidify for 10 minutes, and are then transferred to a 
sterile flask where the calcium chloride solution is removed 
and replaced with a suitable growth media. Encapsulated 
5 library cells can be grown under standard conditions. 

Microdreplets may be generated by any method or device 
that produces small droplets, such as but not limited to, 
two-fluid annular atomizer, an electrostatic droplet 
generator, a vibrating orifice system, and emulsif ication. 
10 Other methods for preparing semi-solid droplets are well 
known in the art; see for example, Weaver, U.S. patent 
4,399,219. 

The following example is a protocol for producing 
microdreplets using the emulsif ication technique (Monshipouri 
15 et al. 1995, J. Microencapsulation, 12:255-262). Using an 
overhead mixer at 2000 rpm, 0.6g sodium polyphosphate and 2% 
sodium alginate are dissolved in 100 ml sterile water, and 
the alginate solution is allowed to degas for 60 minutes. An 
oil phase is prepared by mixing 300ml oil, such as canola or 
20 olive oil, with l.og purified soy bean lecithin for at least 
30 minutes. A slurry containing l.9g calcium sulphate in 10 
ml 50% glycerol is prepared by sonication for at least 15 
minutes. This slurry and a volume of library cells which 
will yield 1-5 cells per droplet are blended into the 
25 alginate solution immediately before introduction to the oil 
phase. The emulsif ication process is initiated by slowly 
transferring the alginate mixture into the oil phase and 
mixing for 10 minutes at 580 rpm. 500 ml sterile water is 
then added and the mixing allowed to continue for 5 minutes. 
30 Microdroplets can then be removed from the oil by 

centrifugation. The microdroplets are washed and resuspended 
in a suitable growth media, ready for culture under standard 
conditions if required. The size of the droplets can be 
examined by phase microscopy. For the purpose of sorting by 
35 FACS or MACS, if the droplets are outside of the desired size 
range necessary for sorting, the droplets can be size 
selected using a filter membrane of the required size limit. 
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According to the invention, components of the reporter 
regimen or the target of a drug screen may also be co- 
encapsulated in a drop with library cell(s). Whole indicator 
cells or cellular fractions containing a bioassay, enzymes, 
S or reporter molecules may be mixed with library cells 
suspended in the medium prior to formation of macro- or 
micro-droplets as previously described. Compounds of 
interest produced by the library cells may accumulate and 
diffuse within the droplet to reach the co-encapsulated 

10 indicator cells or reporter, and generate a signal. The co- 
encapsulated indicator cell may be a live target of the 
desirable compound, e.g. pathogens for anti-inf ectives, or 
tumor cells for anticancer agents. Any change in metabolic 
status of the indicator cells, such as death, or growth 

15 inhibition, constitutes a signal and may be detected within 
the droplet by a variety of methods known in the art. Such 
methods may include but are not limited to the use of 
physiological probes, such as vital stains, or measurement of 
optical properties of the drop. 

20 When the droplets are exposed to components of the 

reporter regimen, metabolites and compounds produced by the 
encapsulated library cells and the reporter components may 
diffuse through the semi-solid medium to produce a signal. 
For example, a physiological probe may be added to a batch of 

25 droplets which are then subjected to the appropriate sorting 
format. If the library cell(s) are allowed to divide within 
the drop, the progeny of the original positive cell(s) are 
kept together in a microcolony, thereby generating a stronger 
signal. It is preferable that the semi-solid medium is 

30 optically compatible with the signal generated by the 
reporter, e.g. transparent to light for a range of 
wavelengths, so that the signal can be efficiently detected. 

Macrodroplets can be sorted using a colorigenic reporter 
either by screening by eye or by using any device that allows 

35 the droplets to pass through a screening point, and which has 
the capacity to segregate positives. Microdroplets can be 
sorted using either FACS or MACS. FACS services are performed 
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by a qualified operator on any suitable machine (e.g. 
Becton-Dickinson FACStar Plus) . Particle suspension 
densities (cells or droplets) are adjusted to i x 10* 
particles /ml. in all cases, positives can be sorted directly 
5 into multi-well plates at 1 clone per well. MACS is 

performed using an mpc-m magnetic tube rack following the 
manufacturer's instructions (Dynal, 5 Delaware Drive, Lake 
Success, New York 11042). 

Encapsulated cells which are found to be positive in a 
10 pre-screen or screen can be recovered by culturing the 

droplet by placing it either on appropriate agar or liquid 
growth media or by dissolving the droplet in sodium citrate. 
After a period of culturing, the positive cells may grow out 
of the droplet. For convenience in handling and storage of 
IS droplets, the subsequent culturing may be done in multi-well 
plates . 

Pre-screened positives which have been reduced to a 
smaller population can then either be frozen and stored in 
the presence of glycerol or grown in multi-well plates. 

20 These can be used to transfer groups of clones using multi- 
pin replicators onto various types of assay plates (e.g. 
differential media, selective media, antimicrobial or 
engineered assay lawns) . Specific assays can also be 
performed within these microtiter plates and read by a 

25 standard plate reader or any other format used in current 
high-throughput screening technologies. 

For clarity of discussion, the following subsections 
describe in more detail the different embodiments of the 
30 invention involving prokaryotic and eukaryotic, donor and 
host organisms. The following embodiments are exemplary and 
are not intended to be limiting. 

5.3. PROTOCOLS FOR THE PREPARATION OF HIGH 

35 QUALITY NUCLRTP & CIDS FPOM DONOR ORGANISMS 

The availability of high quality DNA or RNA as starting 
material is important in the construction of DNA libraries 
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that are representative of the genetic information of the 
donor organisms. Methods for extracting, selecting and 
preparing high quality nucleic acids from cultures of donor 
organisms or from environmental samples are provided in this 
5 section. A method for preparing subtracted DNA probes to be 
used in pre-screening DNA libraries for the purpose of 
enriching DNA related to secondary metabolism is also 
described. 

10 5.3.1. GUANIDINIUM I SOTHIOCYANATE NUCLEIC ACID 
ISOLATION 

Lyophilized or non-lyophilized material can be disrupted 
by passage though a mechanical grinder, or alternatively by 
hand in a mortar and pestle in the presence of fine ground 

15 glass or pumice. Immediately after grinding, ground 

lyophilized material may be mixed with 10 ml of lysis buffer 
per l-2g of material. Lysis buffer is 5M guanidine 
isothiocyanate, 50 mM Hepes pH 7.6, lOmM EDTA, and 5% 
0-mercaptoethanol (or 250 mM DTT) . After mixing and 

20 incubat *on at 50 °C for 5 minutes, the solution is rendered to 
4% sarcosyl, mixed, and incubated for 5 minutes more at 50°C 
prior to centrif ugation at 8000 g. if the supernatants are 
visibly cloudy a 90-minute centrif ugation step at 27,000g may 
be used to sediment unwanted carbohydrates. Alternatively, a 

25 15,000g spin may be used to clear the lysate of unwanted 
contaminants. Following centrif ugation, the supernatant is 
made up to 1.42M CsCl (O.lSg CsCl/ml) and layered onto a 
previously-made 5.7M CsCl/TE (10 mM Tris-HCL/ 1 mM EDTA) 
solution in ultracentr if ugation tubes. Ultra-centrif ugation 

30 can be carried out at 160,000g for 18 hours, 20°C. After 
ultra-centr if ugation, a clear, jelly-like layer at the 
1.42M/5.7M Cr.Cl interface is DNA, while total cellular RNA is 
present as a clear pellet at the bottom of the tube. 

DNA from the ultra-centrifugation step can be dialyzed 

35 against TE buffer, rendered 0.1M NaCl, precipitated with 2.5 
volumes of ethanol, dried and redissolved in an appropriate 
volume of TE. If the DNA layer is white in color, it can be 
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removed and recentrif uged for 8 hours in a 
CsCl/bisbenzidimide gradient to remove remaining 
carbohydrates. The dye can be removed by 2-5 washes with 85% 
isopropanol, and the DMA dialyzed and treated as above. 
S RNA can be redissolved in resuspension buffer (5M 

guanidine iscthiocyanate, 50 mM Hepes, pH 7.6, 10 mM EDTA) , 
diluted to 1.33M guanidine isothiocyanate with a solution of 
50 mM Hepes pH 7.6, io mM EDTA. If total RNA is desired, the 
diluted RNA sample is precipitated by the addition of 2 vol 
10 of ethanol or 1 vol of isopropanol. The precipitated RNA is 
rinsed with 70% ethanol, dried, and resuspended in water or 
formamide, and stored at -70»c until used. 

5.3.2. ISOLATION OP PQT.v (A) -CONTAINING BHft 
15 Since the vast majority of eukaryotic mRNA molecules 

contain tracts of poly (adenylic) acid at the 3' end, up to 
250 bases in length, it can be purified by affinity 
chromatography using oligo-dT cellulose matrix. A wide 
variety of commercially available oligo-dT matrices may be 
20 used, including but not limited to, simple gravity columns, 
para-magnetic particles, spin and push columns. Isolated 
mRNA may be stored either dissolved in water, in formamide, 
or dried at -70 °C. 



25 5.3.3. ENRICHMENT OF N0N— RI BOSOMAL SEQUENCES FROM 
■ TOTAL RNA 

The enrichment of non-ribosomal sequences may be an 
essential step in obtaining useful RNA populations from 
difficult or uncultivable donor organisms. The 

30 fractionation of RNA on neutral sucrose gradients can be 

useful in purifying the predominant ribosomal RNAs away from 
other RNA species (R. McGookin 1984, In Methods in Molecular 
Biology Vol. 2 Nucleic Acids. Humana Press, pp. 109-112). 
Following centrifugation, the samples containing the largest 

3S amounts of ribosomal RNA can be discarded, and the remaining 
fractions dialyzed and precipitated. 
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Other methods which utilize random primers with or 
without random-tailed oligo-dT primers and PCR may be used to 
amplify low amounts of RNA in starting material. 

5 5.3.4- FILL-IN REACTION USING THE KLENOW FRAGMENT 

The use of the Klenow fragment of E. coli DNA 
polymerase, or other DNA polymerase which lacks 
exonuclease activity, to add nucleotides to the 5» cohesive 
ends is a standard technique often used to create blunt ended 

10 DNA molecules after digestion. When used without a complete 
nucleotide set, such an activity can be exploited in creating 
ligation ends that are incompatible with themselves but 
compatible to each other. 

Such a technique has been used to produce high-titer 

15 gene libraries and constructs {Hung et al. 1984, Nuc Acids 
Res 12:1863-1874; Zaborovsky et al. 1986, Gene 42:119; 
Foster, 1991, Ph.D. thesis, University of California, Santa 
Barbara; Loftus et al. 1992, Biotechniques 12:172-175.) 

The fill-in reaction can be carried out with Klenow 

20 buffer (50 mM Tris-HCl pH 7.5, 10 mM MgCl 2 , 50 mg/ml BSA, 1 mM 
dNTP), enzyme (lOU/50^1 reaction), and an incubation of 3-4 
hours at 37 °C. After the reaction, the DNA may be purified 
by a variety of methods, including but not limited to, 
affinity chromatography, ethanol precipitation, and spin- 

25 column centrif ugation . 

5.3.5. PROTOCOLS FOR PREPARATION OF SUBTRACTED DNA 
PROBES F OR PRE— SCREENING 

RNA may be isolated from young, mid log-phase cultures 

30 of onanisms with complex life cycles that have not undergone 

differentiation. This RNA pool is complementary to genes 

involved in undifferentiated growth and primary metabolism. 

The RNA is biotinylated in vitro and hybridized in excess to 

randomly sheared, gene-sized fragments of genomic DNA from 

35 the homologous or closely related heterologous species. 

Phenol extraction of this mixture results in the removal of 

genomic sequences complementary to primary metabolism RNA at 
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the interface. This process may be repeated once. The 
resulting single stranded DNA fragments are composed of the 
(+) strand of primary metabolism genes and the (+) and (-) 
strands of other genes, including secondary metabolism- 
5 related genes. This mixture of DNA is denatured, and 
rehybridized for 5-10 half c 0 ts under highly stringent 
conditions such that only related sequences can rehybridize 
to form double-stranded DNA. The remaining single-stranded 
DNA can be removed by binding to hydroxyapatite or by 
10 digestion with mung bean nuclease. The isolated double- 
stranded DNA representing non-primary metabolism related 
genes may then be labeled using random priming, and used as 
probe to pre-screen a library. 

15 5.3.6. PURIFICATION OF NUCLEIC ACIDS FROM SOIL OR 
OTHER MI XED ENVIRONMENTAL SAMPLES 

Soil samples are flash frozen in liquid nitrogen and 

stored at -7C°C until processed. Alternatively, soil samples 

are stored frozen at -20°C. Samples are either thawed on ice 

20 immediately prior to use, or freeze-dried prior to 

processing. 

Total nv.cleic acids are extracted by a number of 
protocols with minor modifications depending on the physical 
state and source of the material. Dry to semi-dry samples 

25 are frozen and processed directly; very wet samples are flash 
frozen and freeze-dried; oily samples are diluted with 
phosphate buffered saline prior to processing. Any of the 
following procedures may be adapted: Ogram et al. 1987, J. 
Microbiol. Meth. 7:57-66; Steffan et al. 1988, Appl. Environ. 

30 Microbiology, 54:137-161; Werner et al. 1992, J. of Bact. 
174 (15) :5072-5078; Zhou et al. 1996, Appl. Environmental 
Microbiol. 62(2) : 316-322. 

Briefly, 5 g samples are lysed directly by dropwise 
addition to hot guanidium isothiocyanate lysis buffer (see 

35 Section 5.3.1), and subjected to a cesium chloride 

purification. Alternatively, the samples are mixed with 13.5 
ml of DNA extraction buffer (100 mM Tris-HCl pH 8.0, lOOmM 
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EDTA , 100 mM sodium phosphate, 1 . 5 mM NaCl, 1% CTAB 
(hexadecylmethylammonium bromide) and 100 ul of 20 mg/ml 
proteinase K in 50 ml centrifuge tubes and shaken by 
horizontal shaking at 225 rpm for 30 minutes at 37 °C. After 
5 shaking, 1.5 ml of 2 0% SDS is added, and the samples 
incubated at 65 °C for 2 hours, with end-over-end shaking 
every 15-20 minutes. The supernatants are collected by 
centrifugation at 6000 x g for 10 minutes at 20 °C. The 
pellets are re-extracted 3X by adding 4-5 ml of extraction 

10 buffer and 0.5 ml of 20% SDS, vortexing for 1 minute, 
followed by a 10 minute incubation at 65 °C and re- 
centrifugation. Pooled supernatants from 3 extractions are 
extracted twice with chlorof orm-isoamyl alcohol (48:1). The 
nucleic acids are precipitated by the addition of 0.6 volumes 

15 of isopropanol followed by a one hour incubation and 
centrifugation at 16,000 x g for 20 minutes at room 
temperature. The crude nucleic acid pellets are then 
resuspended in 10 mM Tris-HCl pH 8.0, 2 mM EDTA. Further 
purification of the DNA is by DEAE chromatography if needed. 

20 Total RNA is obtained from the crude pellet by selective 
precipitation of RNA by 4M lithium acetate or acid phenol 
extraction (Ausubel et al. 1990, Greene Publishing Associates 
and Wiley Interscience, New York; Hoben et al. 1988, Appl., 
Environ, Microbiology, 54:703-71). 

25 

5.3.7. REPAIR OF DNA 
Nicked or degraded DNA samples are repaired by 
first blunting any fragmented ends with T4 DNA polymerase 
(New England Biolabs) . The DNA is treated in blunting buffer 
30 (50 mM Tris-HCl pH 7 . 8 , 10 mM MgC12 , 40 jxM dNTPs, 5 U/10 M9 
T4 DNA polymerase) for 1-2 hours at 37 °C. The DNA is ethanol 
precipitated by the addition of 1/10 volume of 3M sodium 
acetate and 2.5 volumes of 100% ethanol. 

After centrifugation and resuspension in water, the DNA 
35 sample is trciated with E . coli DNA ligase in E . coli ligase 
buffer (50 mM Tris-HCl pH 7.8, 10 mM MgC12, 10 mM DTT, 26 uM 
NAD+, and 25 uM BSA, 10U of E. coli for 1-2 hours at 16°C. 
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After treatment the DNA sample is diluted 5 fold with a 
solution of 20 mM Tris-HCl pH 8.0, 0.3M sodium acetate and 
extracted once with phenol and once with chloroform The 
addition of 2.5 volumes of ethanol to the aqueous phase 
5 precipitates the DNA. The samples are rinsed two times with 
70% ethanol and resuspended in sterile water or 10 mM Tris- 
HCl, pH 8.0, 1 mM EDTA and frozen at -70°C until used. 

5 -4. PROTOCOLS FOR P ROKARYOTIC EXPRESSION LIBRARIES 
10 The procedures for preparing natural pathway expression 

libraries and chimeric pathway expression libraries using 
prokaryotic host and donor organisms are provided in this 
section. Purified high quality DNA obtained by the 
techniques described in Sections 5.3.1 - 5.3.4 may be used in 
15 the following procedures. 

5.4.1. BACTERIAL SPECIES, STRAINS, AND CULTURE 
CONDITIONS 

Particularly good expression host organisms are 

20 restricti °n-minus, endonuclease deficient, and recombination 

deficient. For E . coli , a preferred strain is XL1-MR 

(genotype: Mcr A— , McrCB-, McrF-, Mrr-, hsdr-, endal-, recA-) . 

For Streptomyces , a preferred strain is S. lividans TK64. 

For Bacillus subtilis , preferred strains are B • subtilis 
25 PB168 trpC2; B . subtilis PB5002 sacA, degUhy; B • suJbtilis 

PB168delta trpC2 , pksdelta 75.8; B . subtilis ATCC 39320 and 

39374. 

The donor organisms are bacterial species. Some are 
selected for the ability to produce a unique compound that is 

30 detectable by current assays. Others are selected due to 
their presence in an environmental sample of potential 
interest. In some examples, marine bacteria were obtained 
from Harbor Branch Oceanographic Institute and Scripps 
Institute of Oceanography. They were generally collected 

35 from international waters more than 200 miles offshore. 
Metabolic tes.ts as well as gram testing and colony 
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morphologies were performed to the level necessary to ensure 
that the samples are taxonomically diverse. 

E. coli are grown at 37 °C when preparing library stocks, 
and at 30 °C for expression. Marine, Actinomyces and 
5 Str&ptomyces species are grown only at 30°C. 

5.4.2. PREPARA TION OF DONOR GENOMIC DMA 
From each species of bacteria, a 10mL culture is grown. 
The bacteria are pelleted by centrif ugation and resuspended 

10 in 10mM Tris, 5mm EDTA (TE) . The DNA may be purified by the 
procedures described in Section 5.1.2., or the bacterial 
pellet may be solubilized in SDS/proteinase K, extracted by 
phenol: chloroform, and precipitated with isopropanol. The 
resulting purified DNA is resuspended overnight in TE. 

15 Aliquots of each purified DNA are subjected to agarose 

gel electrophoresis to confirm integrity and to determine the 
DNA concentration. 

To prepare random large DNA fragments for the natural 
pathway expression library, 20/ug of DNA for each species is 

20 partially digested with a frequent-cutting enzyme, such as 
Sau3A, by incubating in IX enzyme buffer and 0.01-0.5 unit 
enzyme per /xg DNA for l hour at 37 °C. The amount of enzyme 
used may be determined empirically to generate the desired 
size range. The digested DNAs are pooled, phenol : chloroform 

25 extracted, and ethanol-precipitated. 100/xg of this mixture is 
used for each library that requires large native fragments of 
genomic DNA. This mixture can optionally be size- 
fractionated through sucrose gradients. Smaller fragments of 
DNA for the chimeric pathway expression library can 

30 simultaneously be selected by size fractionation. 

The digestion and size fractionation are confirmed by 
subjecting aliquots of the samples to agarose gel 
electrophoresis . 
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5.4.3. SENERAT IQN OF PRQKARYOTIC PROMOTER FRAGMENTS 
In one example , synthetic oligonucleotides are used to 
construct a fragment containing two copies of the beta- 
galactosidase promoter (lac) , one on either side of a unique 
5 BaraHl site, with each copy of lac positioned to direct 
transcription toward the centered BamHl site (Figure 4A) . 
The synthetic oligonucleotides are phosphorylated by the 
synthesizer. 400ng of each oligonucleotide is annealed by 
boiling five minutes and slow cooling over 30 minutes to 25°C 

10 before ligating 3 0 minutes at room temperature with T4 DNA 
ligase. The ligation mix is subjected to agarose gel 
electrophoresis and 2-7 kbp fragments are excised and 
purified by Gene Clean. The joined, paired, and properly- 
oriented cassettes are inserted into the Smal site of the 

15 pBSK plasmid vector by incubation for 16 hours at 15 °C with 
T4 DNA ligase in IX ligase/PEG buffer. The ligation mix is 
introduced into XL1-MR cells. Individual clones are analyzed 
by restriction enzyme analysis and may optionally be 
sequenced to confirm orientation and accuracy. 

20 The pBSK- (lac/ lac) , clones (where n is an integer from 2 

to 10) are cultured in 0.3 liter quantities and the plasmids 
purified using a plasmid preparation kit (Qiagen) . 40/ig of 
the selected and purified pBSK- (lac-lac) n is digested to 
completion with Smal in IX buffer. The digested DNA is 

25 subjected to agarose gel electrophoresis and the lac/lac 

promoter dimers are excised and purified with Gene Clean, and 
digested to completion with BaroHl in IX buffer. See Figure 
4B and 4C. The digested promoter monomers are 
phenol: chloroform extracted, ethanol precipitated, and 

30 dephosphorylated by treatment with CIAP in IX CIAP buffer. 
The dephosphorylated, digested promoters are extracted and 
precipitated as before, and resuspended in TE at a 
concentration of 20ng/jxl before storing at -20°C or further 
use. 

35 in another example, prepared promoter fragments are 

mixed with similarly-prepared linkers that do not contain 
promoter sequences, and then used in ligations with the donor 
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genomic DNA. This allows the generation of cassettes with 
only one promoter, in cases where anti-sense transcription is 
a consideration. 

5.4.4. PREPARATION OF GENE CASSETTES FOR 

COMBINATORIAL CHIMERIC PATHWAY EXPRESSION 
LIBRARIES 

In one example, BamHI-BamHI fragments of genomic DNA 

(mean size 3.5 kbp) are mixed with an excess of 

dephosphorylated promoter fragments, and then ligated. The 

molar ratio of promoters to genomic DNA fragments is 20:1. 

The resulting units (lac / genomic DNA fragment/ lac) will 

have a mean size of approximately 4 kbp. Other prokaryotic 

promoters that may be used include other E. coll promoters 

(Harley et ah, 1987, Nuc Acid Res 15:2343-2361), and 

Streptomyces promoters (Strohl 1992, Nuc Acid Res V20:96l- 

974) for use in Streptomyces species expression hosts. In 

hosts with undetermined or significant recombination ability, 

it is desirable to use a series of different promoters such 

that any clone containing several cassettes will contain 

several different promoters. 

5.4.5. PREPARATION OF SOLID SUPPORT 
Ultralink Immobilized Streptavidin beads were purchased 
from Pierce (Cat. No. 53113). 3M Emphaze Biosupport Medium 
AB1 "blank beads" was purchased from Pierce (Cat. No. 53112) . 
Similar solid supports from other vendors may be substituted 
for this procedure. 

Oligonucleotides were purchased from Life Technologies 
(Gibco-BRL) . Oligonucleotide "Bead-link-5" is 5' biotin-GCC 
GAC CAT TTA AAT CGG TTA AT 3 • . "Bead-link-3" is 5' 
phosphate-TAA CCG ATT TAA ATG GTC GGC 3 1 . When annealed, 
these oligonucleotides contain a Swal restriction 
endonuclease site (shown underlined below) . Annealed bead- 
link oligonucleotides also leave an AT overhang at the 3 ■ 
end. This overhang is shown by holding on oligonucleotide 
bead-link-5. 
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biotin-GCC GAC C AT TTA AAT CGG TTA AT 
CGG CTG G ?A AAT TTA GCC AAT 

Equiinolar amounts of each bead-link oligonucleotide are 
mixed together in an eppendorf tube. 5M NaCl is added to the 
5 tube to a final concentration of 3 00mM. The reaction is 
incubated at 60°C for 1.5 hr. Annealing was confirmed by 
agarose gel electrophoresis using non-annealed 
oligonucleotides as a control. 

To prepare blank beads, lOOmg dry beads was resuspended 

10 in lml phosphate buffered saline (PBS) . Bovine Serum Albumin 
(BSA) was added to final concentration of lmg/ml. Beads were 
rotated for 4 hrs at room temperature. Beads were pelleted 
by centrifugation and washed 3x with 1M Tris-HCl pH8.0 for 2 
hours at room temperature to block unreacted az a lactone 

15 sites. Beads were pelleted by brief centrifugation and were 
washed extensively with PBS . Blank beads were stored in PBS 
at 4°C until used. 

To bind bead-link oligonucleotide to streptavidin beads 
lO^g previously-annealed oligonucleotides were mixed with 

20 20/il Ultralink Immobilized Streptavidin beads in lx binding 
buffer (PBS, 500mM NaCl) . Beads were incubated for three 
hours at room temperature with inversion to keep the beads 
suspended. Beads are pelleted and washed 3x with lml binding 
buffer. Beads are then washed and equilibrated with lx 

25 ligation buffer (50mM Tris-HCl pH7.8, lOmM MgC12 , lOmM 

dithiothreitol, ImM ATP, 25Mg/ml BSA). Beads are stored at 
4°c until used. 

5.4.6. ASSEMBLY OF A COMBINATORIAL CHIMERIC 
30 PATHWAY EXPRESSION T.TRP&PV 

Attachment of gene cassettes to magnetic beads: The gene 

cassettes are phosphorylated using T4 polynucleotide kinase 

in IX kinase buffer. The phosphorylated fragments are 

ethanol precipitated and resuspended in TE. 1/10 of this is 

35 ligated to a mixture of two short non-phosphorylated 

synthetic linkers. The remaining 9/10 is used for a later 

procedure. Each linker will have one of two rare-cutting 
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enzymes, either Notl or Srfl. In addition, the Notl- 
containing linker is biotinylated at the time of synthesis of 
the oligonucleotides. The Notl and Srfl linkers are mixed 
with the phosphorylated transcription units in the ratio, 
5 respectively, of 100:100:1, and ligated with T4 DNA ligase in 
IX ligase/PEG buffer for 16 hours at 15°C. This mixture is 
allowed to bind to avidin-conjugated MPG magnetic beads, and 
the manufacturer's protocols are used to remove the bead - 
bound transcription units from the ligation mixture. 

10 In the mixture of ligated DNA, approximately 1/2 will 

have a biotinylated Notl linker placed at one end and a Srfl 
linker at the other end. The Notl ends will be bound to the 
beads by avidin-biotin linkages. The fragments with Notl 
linkers at both ends are not involved in further addition 

15 steps. The fragments with Srfl linkers at both ends are not 
retained in the magnetic separation step. 

Preparation of pool of DNA for addition to beadbound 
DNA: The remaining 9/10 of the phosphorylated transcription 
units are licated as above, but to the Srfl linkers only, 

20 followed by digestion to completion with Sr.fl, 

dephosphorylation, purification and ethanol precipitation. 

De-protection of bead-bound DNA: Transcription units 
bound to the beads are digested to completion with the Srfl 
enzyme in IX Sr.fl buffer. The reaction is heat- inactivated 

25 and the beads are removed by magnetic separation. 

Concatenation: The beads are then added to a ligation 
mix containing the dephosphorylated Srf 1-Srf 1 digested 
transcription units in IX ligation buffer. Ligations are 
commenced by addition of T4 DNA ligase and proceed for 60 

30 minutes, 25°C, before heat-inactivation of the ligase and 
magnetic separation of the beads. Ligations will primarily 
occur between phosphorylated bead-bound DNA and non- 
phosphorylated transcription units. The transcription units 
on the bead are phosphorylated by T4 polynucleotide kinase, 

35 heat-inactivcited, magnetically-separated, and returned to the 
ligation mixture with the addition of more T4 DNA ligase. 
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This cycle is repeated ten tiroes before cleaving the 
polymer from the beads by digestion with Notl. The cleaved 
DNA is ethanol precipitated, resuspended in TE, and viewed on 
an agarose gel to gauge the quality. and size range before 
5 insertion into the SuperCos 1 or other vector, according to 
the expression host. The concatemers are used to generate a 
prokaryotic library in the relevant expression host as 
described in Section 5.4.5. 

10 5.4.7. ASSEMBLY OF A COMBINATORIAL NATURAL PATHWAY 
EXPRESSION LIBRARY 

The expression vector for an E. coll library is 

desirably the cosmid SuperCos 1, capable of maintaining 

inserts of 30-4 2kbp in size. Insertion of the DNA fragments 

15 into SuperCos 1 and packaging with Gigapack extracts are 

performed according to the manufacturer's directions 

(Stratagene) . 

Briefly, XL1-MR host cells are infected with SuperCos 1 
phage containing the DNA library. This is performed as 

20 follows: XL1-MR cells are grown overnight in 5mL LB medium 
with 1% maltose, 10mm MgS0 4 at 300 rpm, 37°C. The overnight 
culture is diluted 1:10 and cultured 3 hours in LB/lOmM MgS0 4 
at 300 rpm, 37 °C. The culture is pelleted by centrifugation 
at 800xg and resuspended in 5mL LB. 600^1 of this suspension 

25 is incubated with SOOcfu of library packaged in phage 

particles for 3 0 minutes at ambient temperature, followed by 
a 60 minute incubation with 8 vol LB at 300 rpm, 37 °C. 

In order to amplify the expression libraries, the 
infected host cells are spread on 150mm Petri dishes with 

30 50mL LB, 50/jg/mL ampicillin. The plates are previously dried 
for 48 hours at ambient temperature. After spreading, the 
plates are allowed to incubate overnight at 37 °C. The plates 
are scraped and the colonies resuspended with 3mL 
15% glycerol, 85% LB per plate. This bacterial suspension is 

35 stored at -70 °C for further use. 

To prepare the libraries for screening individual 
clones, the infected host cells are spread on 150ram Petri 
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dishes with 50mL LB, 50mg/mL ampicillin. The plates are 
previously dried for 48 hours at ambient temperature. After 
spreading, the plates are allowed to incubate overnight at 
37°C. Resulting colonies are picked with sterile toothpicks 
5 and transferred one per well to multi-well plates. Each well 
of a 384-well plate contains 75/mL LB, 50/ig/mL ampicillin, 7% 
glycerol. The outer rows (80 wells total) are not inoculated 
but are similarly filled with medium to provide an 
evaporation barrier during subsequent incubation and 

10 freezing. These inoculated master plates are placed at 37 °C 
for 16 hours without shaking. The overnight master 384-well 
plates are used as a source plate to replicate into one or 
more working 384-well plates or Omni-Trays. The master 384- 
well plates are then sealed individually and frozen at -80°C. 

15 Replication is done with a 3 84-pin replicator. Before and 
after each use, the 384-pin replicator is dipped sequentially 
into bleach for 2 0 seconds, water for 3 0 seconds, then 
ethanol for 5 seconds before flaming. Methods of library 
assembly are dependent on the selection of vector and 

20 expression host. 

5.4.8. PRE-SCREENING OF EXPRESSION LIBRARIES 
There are three categories of pre-screens: intracellular 
differential, and selection. 

25 Briefly, the first category, intracellular pre-screening 

entails introduction of the library into a host engineered to 
contain a chemo-responsive reporter construct. The reporter 
is GFP (green fluorescent protein) or 0-galactosidase, and 
selection is done by fluorescence-activated cell sorting 

30 (FACS) or macrodroplet sorting. 

The second category, differential pre-screening, entails 
incubation oi; the library in the host with fluorescent or 
chromogenic physiological tracers, followed by FACS or 
macrodroplet sorting. 

35 The third category, selection pre-screening, entails 

incubation of the library in the host with selective agents 
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such as antibiotics , followed by FACS or macrodroplet sorting 
to identify surviving or multiplying cells. 

For all methods , cell sorting is done on bulk cultures 
of amplified libraries prior to examination of individual 
5 cultures. 

The libraries may be pre-screened by FACS or 
macrodroplet sorting. Pools of host cells containing the DNA 
libraries are cultured in one of two formats promoting either 
high or low density micro-environments. 

10 In the first format, cells of the amplified library are 

examined as individual cells. An E. coll library aliquot is 
grown for 4 hours at 30°C in 20 vol medium at 300 rpm before 
pelleting, resuspension in 1 vol sterile ddH 2 0, incubation 
with fluorescent probes (as needed) , and placement on ice for 

15 transfer to the FACS facilities. 

In the second format, aliquots of the amplified library 
are encapsulated and cultured in the presence of substrates 
or selection agents as described in Section 5.2.3 before 
transfer to the FACS or macrodroplet sorting facilities. 

20 For cultures to be examined with fluorescent tracers or 

substrates, the cultures resuspended in ddH 2 0, are stained 
before FACS following the manufacturers protocols, typically 
as follows: incubations are in the dark, at room temperature, 
for 15 minutes, followed by pelleting for 5 minutes in a 

25 i.5mL microfuge tube and resuspension in 1 vol cold ddH 2 0. 
After sorting, pools of selected 1-1000 clones or 
macrodroplets from the expression libraries are cultured in 
0.5L nutrient media. The cultured bacteria and media are 
processed for chemical analysis by extraction with 0.5L ethyl 

30 acetate. Rotary evaporation yields a crude organic extract 
of approximately 20mg-lg extract per liter culture. The 
cognate cloned DNAs are purified and re-transformed into host 
cells to confirm the localization of relevant sequences to 
the cosmid. Chemical samples generated by expression from 

35 library clones may be examined by HPLC using a series of 

columns (cationic, anionic, reverse phase) and subsequently 
by qualitative chemical analysis using NMR. 
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5.4.9. METABOLIC TESTING OF MARINE GRAM(-) /B. COLI 
LIBRARY BY PLATE REPLICATION 

Each wild-type marine species is tested prior to 
preparation of the DNA libraries to prevent redundancy and to 
5 help determine the array of metabolic tests to be done on the 
completed libraries. 

To prepare the libraries for screening individual 
clones, the infected host cells, such as E. coll XL1-MR, are 
spread on 150mm Petri dishes with 50 ml LB, 50mg/ml 

10 ampicillin. The plates are previously dried for 48 hours at 
ambient temperature. After spreading, the plates are allowed 
to incubate overnight at 37 °C. Resulting colonies are picked 
with sterile toothpicks and transferred one per well to 
384-well plates. Each well contains 75 fil LB, 50 fig/ml 

15 ampicillin, 7% glycerol. The outer rows (80 wells total) are 
not inoculated but are similarly filled with medium to 
provide an evaporation barrier during subsequent incubation 
and freezing. These inoculated master plates are placed at 
37°c for 16 hours without shaking. The overnight master 

20 384-well plates are used as a source plate to replicate into 
one or more working multi-well plates or Omni-Trays. The 
master 384-well plates are then sealed individually and 
frozen at -80°C. Replication is done with a 384-pin 
replicator. Before and after each use, the 3 84 -pin 

25 replicator is dipped sequentially into bleach for 20 seconds, 
water for 30 seconds, then ethanol for 5 seconds before 
flaming. 

Working multi-well plates or Omni-Trays are used as 
source plates to replicate the DNA libraries onto a series of 
30 differential and/or selective media (e.g. siderophore 

detection media or antimicrobial lawns) . The results are 
compiled and compared to the profiles of the wild-type marine 
bacteria used to construct the DNA library. 
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5.4.10. METABOLIC TESTING OF MARINE GRAM(-) /E. COLI 
LIBRARY BY MAPPnn RQPLET ENCAPSULATION 

Clones are encapsulated by taking sodium alginate and 
dissolving in 100 mL of sterile water at a concentration of 
1% using an overhead mixer at 2000 rpm. A volume of library 
suspension is added so as to embed 1-5 clones per droplet. 
The mixture is allowed to sit for at least 30 minutes to 
degas. The mixture is then extruded through any device that 
allows it to form individual droplets. One such example is a 
syringe with a 25 gauge needle. These are dropped into a 
gently stirring beaker of l35mM calcium chloride. Droplets 
are allowed to harden for 10 minutes and then are transferred 
to a sterile flask and the calcium chloride removed and 
replaced with LB/ Amp media and a substrate (e.g. 
x-glucosidamine) . Flasks containing the droplets are then 
shaken at 30°C overnight and examined the following morning 
for positive clones indicated by the presence of blue 
colonies. 

Droplets are placed in a single layer in a large clear 
tray and scanned by eye. Positive colonies are removed and 
placed in 96-well master plates containing LB /Amp and 50 mM 
sodium citrate pH 7.4 to dissolve the droplet, and allowed to 
grow at 37 «C overnight. These overnight master 96-well plates 
are used as a. source plate to replicate into one or more 
working multi-well plates or Omni-Trays. The master 96-well 
plates are then sealed individually and frozen at -80°C. 
Positive clones can then be either sent for specific testing 
of the products or sent through another round of 
pre-screening or screening. Further screening may be 
performed by replication which is done with a multi-pin 
replicator. Before and after each use, the multi-pin 
replicator is; dipped sequentially into bleach for 20 seconds, 
water for 30 seconds, then ethanol for 5 seconds before 
flaming. 
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5.4.11. METABOLIC TESTING OF MARINE GRAM(-) /E. COLI 

LIBRARIES BY MT CRODROPT,ET ENCAPSULATION 
Microdroplets may be generated by the following method. 
Using an overhead mixer at 2000 rpm, 0.6g sodium 
5 polyphosphate and 2% Sodium alginate are dissolved in 100 ml 
sterile water. This mixture is allowed to degas for 60 
minutes. Then 1.9g calcium sulphate is sonicated in 10 ml 
50% glycerol for at least 15 minutes. This slurry and a 
volume of the library suspension which will yield 1-5 cells 
10 per dr °Plet are blended into the alginate solution 

immediately before introduction to an oil phase (olive oil) 
which has been premixed with the addition of l.Og purified 
soy bean lecithin for at least 3 0 minutes. The 
emulsification process is initiated by slowly transferring 
15 the alginate mixture into the oil phase and mixing for 10 
minutes at 580 rpm. 500 ml sterile water is then added and 
the mixing allowed to continue for 5 minutes. Microdroplets 
can then be removed from the oil by centrifugation and washed 
and resuspenced in LB/ Amp. For the purpose of sorting by 
20 FACS, if the droplets are outside of the desired size range 
necessary for sorting, the droplets can be size selected 
using a filter membrane of the required size limit. Clones 
can then be crown 2 hours at 30°C with shaking in LB /Amp 
media containing a fluorescent substrate. 
25 Following incubation the sample is prepared for sorting 

with FACS by centrifuging, washing and resuspending in 
sterile water at a density of l x 10 6 droplets per ml. The 
size of the droplets can be examined by phase microscopy. 
FACS services are performed by a qualified operator on a 
30 Becton-Dickinson FACStar Plus and positives are sorted 

directly into multi-well plates containing LB/Amp, isolating 
positives to 1 clone per well. These plates are allowed to 
grow at 37°C until the colonies grow out of the beads (1-2 
days) . These overnight plates are used as a source plate to 
35 replicate into one or more working multi-well plates or 
Omni-Trays. The master multi-well plates are then sealed 
individually and frozen at -80°C. Positive clones can then 
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be either sent for specific testing of the products or sent 
through another round of pre-screening or screening. Further 
screening may be performed by replication which is done with 
a 96 or 384-pin replicator. Before and after each use, the 
5 replicator is dipped sequentially into bleach for 20 seconds, 
water for 30 seconds, then ethanol for 5 seconds before 
flaming. 



15 



20 



5.4.12. METABOLIC TESTING OF 
10 ACTINOMYCETES / STREPTOMYCES LIVIDANS LIBRARY 
BY PLAT E REPLICATION 

Each cultivable wild-type actlnomycete species is tested 

prior to preparation of the DNA libraries to prevent 

taxonomic redundancy, and to help determine the array of 

metabolic tests to be done on the completed libraries. To 

prepare the libraries for screening individual clones, the 

transformed host cells, Streptomyces lividans TK66, are 

spread on 150mm Petri dishes with F10A. The plates are 

previously dried for 48 hours at ambient temperature. After 

spreading, the plates are allowed to incubate overnight at 

30 °C. Selection is initiated by overlaying with 

thiostrepton. Resulting colonies are picked with sterile 

toothpicks and transferred one per well to 96-well plates. 

Each well contains F10A media. These inoculated master 

plates are placed at 30 °c for 1-4 days. The overnight master 

96-well plates are used as a source plate to replicate into 

one or more working multi-well plates or Omni-Trays. The 

master 96- well plates are then sealed individually and 

frozen at -80 °C. Replication is done with a multi-pin 

replicator. Before and after each use, the multi-pin 

replicator is dipped sequentially into bleach for 20 seconds, 

water for 30 seconds, then ethanol for 5 seconds before 

flaming. 

Working multi-well plates or Omni-Trays are used as 
source plates to replicate the DNA libraries onto a series of 
differential and/or selective media (e.g. antibiotic plates 
or antimicrobial lawns) . The results are compiled and 
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compared to the profiles of the wild-type bacteria used to 
construct the DNA library. 

5.4.13. METABOLIC TESTING OF 

5 ACTINOMYCETES/STREPTOMYCES LJVIDANS LIBRARY 
BY MACRODROPLET ENCAPSULATION 

Clones are encapsulated by the method as described in 

Section 5.4.10 for E. coli libraries. Droplets are allowed 

to harden for 10 minutes and then are transferred to a 

sterile flask and the calcium chloride removed and replaced 

10 with F10A media and a substrate (e.g. x-gal) . Flasks 

containing the droplets are then shaken at 30°C for 1 -5 days 
and examined for positive clones indicated by the presence of 
blue colonies . 

Droplets are placed in a single layer in a large clear 
tray and scanned by eye. Positive colonies are removed and 
placed in 96-well master plates containing F10A 50 mM sodium 
citrate pH 7.4 to dissolve the droplets and then grown at 
30°C for 2 days. These overnight master 96-well plates are 
used as a source plate to replicate into one or more working 

20 multi-well plates or Omni-Trays. The master 96-well plates 
are then sealed individually and frozen at -80 °C. Positive 
clones can then be either sent for specific testing of the 
products or sent through another round of pre-screening or 
screening. Further screening may be performed by replication 

25 as described above in Section 5.4.9. 

5.4.14. : PRE— SCREENING OF CLONES BY CO- ENCAPSULATION 
WITH INDICATOR CET.T.S 

Pools of library clones are titered by plating 
30 appropriate dilutions and performing colony counts. Adequate 
library cells are mixed in 1% alginate to result in 
approximately 1 cell per macrodroplet . In addition, adequate 
indicator cells are included to result in approximately 50 
target cells per droplet. Macrodroplets are produced as 
35 described in Section 5.4.10, and cultured under appropriate 
conditions for the library and indicator cells. 
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In general, S. livldans library raacrodroplets are 
cultured at 30-c in R5 or fioa, and *. call library 
macrodroplets are cultured at 30-37«c in LB or B3 . The media 
and temperature may be adjusted to accommodate the 
5 physiological needs of the indicator cells. To visualize 
effects of the library cell has on the indicator cells, the 
following reporter regimens are utilized: to detect cell 
death inclusion of neutral red or congo red; to detect cell 
viabxlity, inclusion of substrate relevant to indicator cell 
10 (e.g., X-glucopyranoside for S. faecalis) ; to detect B— 
galactosidase reporter activity in response to promoter 
activation, inclusion of 80 mg/ml x-gal in culture media. 
After isolation of positive macrodroplets as described in 

15 T n Vt°\ 5 ' 4 ' 10 ' indiCat ° r Cells eliminated by addition of 

15 antibiotics that are selective for the library cells but not 

the indicator cells. The library cells are then stored 

and/or further examined as desired. 
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PROTOCOLS FOR FTTK^YOTTC Eypp^STOM r T n P » nT ,. 
This section describes procedures that may be generally 
applied to prepare combinatorial gene expression libraries of 
eukaryotic donor organisms. The steps involved in the 
preparation of a combinatorial chimeric pathway gene 
expression library in eukaryotes are shown in Figures 5A-5F. 

Particularly good expression eukaryotic host organisms 
are stable, non-filamentous, and characterized sufficiently 
so as to be genetically manipulatable for the purposes of 
gene expression. For yeast and fungi, a preferred species is 
S pombe, which is grown at 30-c (C. Guthrie and G.R. Funk, 
30 Guide to Yeast Genetics and Molecular Biology, Methods in 
Enzymology, Vol. l 94 , Academic Press). A. thaliana and N. 
tabacum cells are preferred hosts (CP. Lichtenstein & J. 
Draper, Genetic Engineering of Plants, DNA Cloning Vol. II 
PP. 67-119). 



35 



97 



WO 98/17811 

PCT/US97/19958 

reHcn, " hiCh C ° nSiStS " -1 o^in, 

nreM w fences of no apparent function. Thus, in 

library stanLrH ^ "'"-coding DMA .e^encos fro- a 

the pr... nce " M C D ' H C ^ a"" PUri "=" i °" *» 

t0 «. tth e„ s . 1984) „" y r„ ee Tt" 5 e> Hoechst 33258 (co ° ney * 



5.5.2. 



z: ::z::™ztvzr- specmc pri - s — 

choice « prL te r , nd ^1™" ™" t — 1 »— • — 

by th. host ora,„i S „i:r l s r uence ; oan be »- 

^ U • ^ °r instance if s. oomb& -i , le! ^ 

a rr: ssion host - both nativ * rr: r 
°l::: s ' e r ™vi e pj ™ s such as th ° s * — 

B-43 2 tl' 432 sV' ' ° (F ° rSbU ^' "93 Nuc Acid Res. 

^ : y o irr 0 :- g -' chorionic — — « 

PP 217-221, r l ama ' °kayama 1990, FEBS Letters 268(1) 
" thos^ found in ^ Peters similar J 

^ 1992 o tet «-ycline system (Faryar et 

!!?' CUIr ^ 21:3 «-349, may also be used, 
avail n reaCti ° ns ^ be Performed in a commercially 
aid ^ ll" naChine USi " 9 PCR — conditions 

30 but not S2TTT ^ hi9h fidGlity thro ^t, such as 

oolvZ ' P0l ^ e « s e (Stratagene) or Vent 

polymerase (New England Biolab^ o- \ 

will use the same reaction oondL i 

be determined ^i^^T^ST"' T C ° nditi ° nS 
DM} . *^ liy b * techniques known in the art. 

35 synthesTTi e0tide ***** commercially or 

synthesized by methods well known in the art. 



- 98 



WO 98/17811 



PCT/US97/19958 



The promoter and terminator fragments generated by PCR 
may comprise restriction sites at the 5* ends. Bgl II, Xho 
I, and BamHI are used herein to illustrate the principle of 
the invention. Any restriction sites may be used as long as 
5 the site does not appear within the promoter or terminator 
gene sequences. 

To generate cloning sites compatible to cDNA or genomic 
DNA inserts, cleavage of the promoter gene fragments with Bgl 
II and Xho I will generate promoter gene fragments which have 

10 at their 5- ends a Bgl II site and an Xho I site at their 3' 
ends. Terminators are cut only with Xho I and will have only 
an Xho I site at their 5* end. 5' and 3' orientations are 
based on the expected direction of transcription across the 
promoter or terminator gene fragment. See Figure 5B. 

IS Partial fill-in reactions utilizing the large subunit of 

E. coli DNA polymerase I (Klenow fragment) and a subset of 
deoxynucleotides (in this case dCTP and dTTP) may be used to 
generate promoter and terminator fragments that are incapable 
of self -ligation by their Xho I ends. The Bgl II ends of the 

20 promoter fragments cannot be affected because of the lack of 
base-complementarity, and the BamHI end of the terminator 
fragments have no exposed 5' end for the Klenow fragment to 
utilize. 

Treatment with a phosphatase, such as calf intestine 
25 alkaline phosphatase, will prevent Bgllls self-ligations, and 
provide similar termini for ligations in both the promoter 
and terminator fragments. cDNA fragments are protected from 
digestion with NotI by incorporation of 5 '-methyl dCTP during 
first strand synthesis (Short, J.M. 1988, Nuc Acids Res 
30 16:7583-7600) . 

In an alternative embodiment of the invention, when DNA 
inserts are derived from mRNA, directional cloning may be 
applied to improve the efficiency of cloning. The cDNA 
inserts can be unidirectionally ligated in the sense 
35 orientation with respect to the promoter and terminator 
fragments. This can be achieved by generating different, 
non-ligatible ends on both promoter and terminator fragments. 
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Bgl II, Xho I, Xma I, and BamHI are used to illustrate the 
invention. Any pair of enzymes that generate compatible ends 
and can be protected by methylation can be used. 

An Xmal site is substituted for the Xho I site at the 5' 
5 ends of the terminator fragments, while the preparation of 
the promoter 'fragments is unchanged. Xma I is used because 
it is compatible with Not I by filling in with Klenow 
fragment and dCTP. This results in a terminator fragment 
that has a two-base dCTP-dCTP 5' overhang, which is 
10 compatible with suitably prepared Not I digested cDNA gene 
fragments. See Figure 5A. 

5.5.3. PREPARATION OF DNA INSERTS 
Coding gene fragments for the eukaryotic libraries will 

15 be derived from two principal DNA sources, namely that of 

genomic DNA (gDNA) or complementary DNA derived enzymatically 
from messenger RNA (cDNA) . Strategies for preparation of 
gDNA or cDNA are very similar, but not identical. 

Complementary DNA is made from messenger RNA and/ or 

20 total RNA using standard protocols available in the 

literature, or particular to a manufacturer's instructions. 
Isolation of total RNA may be accomplished simultaneously 
with genomic DNA by the guanidium-isothiocyanate method 
described in Section 5.3.1, and mRNA can be isolated by 

25 subsequent affinity chromatography over oligo-dT cellulose. 
First strand cDNA synthesis can use an oligo-dT DNA 
primer that contains a cloning site, e.gr., a Not I site, at 
the 5» end. An oligonucleotide of random sequence, which 
contains an internal Not I site near its 5' end, can also be 

30 used for randomly-primed first strand synthesis. The use of 
this alternative primer avoids 3' bias for large mRNAs. 
Methylated deoxynucleotide, such as 5-methyl-dCTP may be used 
with a polymerase such as Pfu to provide protection from 
restriction digestion (Short et al., supra; G.L. Costa, 1994, 

35 Strategies 7:8). Only non-methylated sites present in the 
initial primers will be available for cleavage, thus ensuring 
a defined 3' end for the cDNAs. Methylated cDNA can also be 
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produced by treatment with methylation, but the 
directionality of the cloning will be lost because all 
available sites will be methylated, and thus resistant to 
enzymatic cleavage, 
5 Defined 5« ends of cDNA may be prepared by ligation of 

sequence-specific adapters, such as a modified BamHl adapter 
which has a 5 • phosphate. When annealed to its partner 
oligonucleotide, the adapter contains only a two-base dGTP- 
dATP 5' overhang and a blunt 5' phosphate end. This modified 

10 adapter can be ligated to cDNA that has been treated with Pfu 
or T4 DNA polymerase as in standard protocols. After 
ligation of modified BamHl adapters and digestion of the cDNA 
with Not I , the adapted cDNA can be treated with Klenow 
fragment and dGTP generating a defined, directionally 

15 oriented cDNA gene insert ready for ligation to suitably 

prepared promoter and terminator fragments. The orientation 
of the fragments is such that the 5' end of the cDNA is 
located toward the 3 ■ end of the promoter, and the 3' end of 
the cDNA is located toward the 5 1 end of the terminator 

2 0 fragment. See Figure 5C. 

Genomic DNA fragments are obtained by partial digestion 
of total genomic DNA with a frequently cutting restriction 
enzyme, such as Sau 3AI. This enzyme is widely used for this 
purpose, and partial digestion followed by sizing though 

25 sucrose gradients is a very standard technique. Fragment 
pools from three different digestions that vary in the 
concentration of initial enzyme can be used to allow for 
differences in enzyme sensitivity within the genomes. 

Following size fractionation and purification, the 

30 fragments can be treated with BamHl methylase to protect any 
internal BamHl sites, followed by treatment with Klenow 
fragment and dATP & dGTP. This results in gene fragments 
that are internally methylated at BamHl sites, and possess 
only dATP -dGTP overhangs, see figure 5D. These fragments are 

35 incapable of self -ligation, and are only capable of ligating 
to suitably prepared promoter and terminator gene fragments. 
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5.5-4. LIGATION OF INSERT DNA TO PROMOTERS AND 
TERMINATORS 

Suitably prepared cDNA, promoter, and terminator 

fragments can be ligated at 16 °C over night. A ratio of 10 

promoter (P) : 1 cDNA: 10 terminator (T) may be used in the 

ligation reaction. The optimal ratio may be determined 

empirically by techniques known in the art. The directional 

cloning procedure provides only one ligation product, i.e., a 

correctly oriented promoter-sense insert-terminator gene 

cassette. 

Ligation of prepared genomic DNA, promoter, and 
terminator gene fragments may be carried out at 16 °C with 
varying ratios. Since none of the ligation components can 
self-ligate, the optimal ratios may be determined 
15 empirically. It is estimated that half of the ligation 
products formed are directly useable, 1/4 of the products 
formed cannot enter the rounds of ligations, and 1/4 of the 
products can be ligated only once before terminating the 
growing chain. 

The following combinations (p=promoter, frag=5 , ->3' 

20 

genomic DNA fragment, T=terminator garf=3 , -*5 l genomic DNA 
fragment) : 

1. P-frag-T 5. P-garf-T 

2. T-frag-P 6. T-garf-P 
25 3. P-frag-P 7. P-garf-P 

4. T-frag-T 8. T-garf-T 

Combinations 1,6 & 2,5 represent the desired constructs, 
but because the orientations of the inserts are random, it is 
expected that 50% of these constructs will be in the correct 

30 orientation for any given gene (1 and 6) . 

Terminator/terminator gene cassettes may form, but 
cannot be involved in any subsequent cloning step because of 
the lack of an exposed 5» end because of the blunted, uncut 
BamHI end at their 3 1 termini . 

35 Promoter /promoter constructs will clone in subsequent 

ligations only to other exposed BamHI ends, because the Bgl 
II end lacks a 5' phosphate (first round). Subsequent 
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ligations to the exposed Bgl II end should be rare with 
incoming gene cassettes because of the lack of 5' phosphates. 
Exposed BajnHI ends will only be made possible on resident 
forming chains and not on incoming new gene cassettes. Thus 

5 it is expected that such promoter /promoter gene cassettes 
will terminate a chain by circularization with a nearby BajnHI 
site on another chain, such circular izat ions are non- 
recoverable. If such promoter /promoter fragments become a 
significant problem to ligation efficiencies then an 

10 intermediate kinase treatment of the fixed growing chains 
prior to addition of new gene cassettes should allow the 
promoter/promoter fragments to extend the growing chains by 
forming Bgl II/Bgl II ligation products. The kinase 
treatment will promote Bgl II/Bgl II and Bgl II/BamHI 

15 ligations on the solid phase, which will circularize the 
growing chains involved. 

5.5.5. SERIAL LIGATIONS OF GENE CASSETTES TO FORM 

CONCATEMERS 

Ligation of the gene cassettes, each consisting of 
either genomic DNA or cDNA insert flanked by 
promoters/terminator combination will be performed in a 
method analogous to that outlined previously for prokaryotic 
DNAs . The major difference here is that this strategy used 
the endonuclease BamHI to create exposed 3 1 restriction sites 
for subsequent cloning. The use of either BamHI methylase or 
5-methyl-dCTI' insures that BajnHI sites within the insert DNA 
will be protected. See Figure 5E. 

After 5-10 rounds of chain ligation, the growing chains 
of concatemers will be deprotected with BamHI and prepared 
for ligation to the expression vector by treatment with the 
Klenow fragment and dATP and dGTP. This will render all ends 
of the growing chain incapable of ligating to each other, 
thus eliminating any circularization and loss of concatemer 
35 chains. 

Vector DNA can be ligated to concatemer chains in a 5:1 
molar ratio. Other ratios may also be used. The can be done 
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at 16-C for a-ia hours, or at 22-c for four hours. Following 
ligation the beads can be washed and resuspended in intron 
nuclease restriction buffer. Digestion will be carried out 
as described by the manufacturer's instructions. Any intron 
5 nuclease may be used. The enzyme Ceul is preferred for it 
produces non-palindromic 3' overhangs, which are useful in 
preventing self -ligations. See Figure 5F. 

5.5.6. CIRCULARI 2ATI0N AND T ^ SF0 ^"°; 

VECTOR CONTAINING CnncateiBPr CONSTRUCTS 

10 concatemer-vector molecules released from the solid 

phase can be encouraged to undergo intra-molecular ligation 

by dilution of the Ceul digestion mix 100-fold with IX ligase 

buffer. T4 ligase can be added, and the reactions may be 

carried out at 22 °C for 4-6 hours, or 16 °C overnight. See 

15 Figure 5F. The resulting constructs may be concentrated by 
microfiltration or f reeze-drying, and introduced into either 
S. pombe strains, or alternatively into E. coli or S. 
lividans strains by standard methods. Any method may be 
used, including but not limited to electroporation, and 

20 modified calcium-phosphate transformation methods. 

5 5.7. PREPARATION AND LIGATION OF PREPARED VECTOR 

FOK EXPRESSION IN YEAST . ■ 

This section describes procedures that may be generally 
25 applied to prepare combinatorial gene expression libraries 
using yeast as the host organism. 

For preparing a library in S. pombe, one possible 
vector, but certainly not the only vector, is the E. coli/S. 
pombe shuttle vector pDblet (Brun et al. 1995, Gene, 164:173- 
30 177) . This vector has the advantage of having multiple 
cloning sites and fl phage origins, being expressed at 
moderately high copy number and being very stable in both E. 

coli and S. pombe. 

For the present invention, the multiple cloning site 
35 (MCS) of pDblet may be modified to accommodate a BstXI site 
of known sequence. See Figure 6B. This is because the 



- 104 - 



WO 98/17811 PCTmS97/l9958 

intron nuclease enzyme that is used to release the concatemer 
chain from the solid phase generates 3' nucleotide overhangs 
of a defined sequence ( 3 1 GATT . . . ) • An engineered BstXI site 
having the sequence CCACCTAACTGG generates the appropriate 
5 CTAA-3 1 overhang after cleavage. 

To modify pDblet, it can be first cut with SacI & NotI 
to remove the existing BstXI site which does not have the 
correct sequence. The pDblet plasmid, once purified by spin- 
chromatography or other means, can be mixed with a 
10 presynthesized oligonucleotide which contains in addition to 
a correct sequence for the BstXI site, a new NcoX site and 
SacI- and NotI- compatible overhangs. See Figure 6C. After 
ligation and transformation, mini-preps of clones are checked 
for correctness by digestion with Ncol . Correct clones will 
15 be identified by the presence of both a BstXI and Ncol site. 
Treatment of this modified pDblet, with BstXI followed by 
Xhol sites generates a vector that contains a 5' Xhol site 
and a 3' CTAA BstXI overhang. See Figure 5E. This cleaved 
vector can be treated with Klenow fragment and dCTP and dTTP 
20 to render it incapable of ligating to itself. Such a vector 
may be used to accept the concatemer chains. 

In one embodiment, the invention encompasses cosmid 
vectors that contain an autonomously replicating sequence of 
S. pombe, and thus can be used to prepare combinatorial gene 
25 expression library in S. Pombe. A series of cosmid vectors 
can be constructed which comprises as least one cloning site 
for insertion of donor DNA, cos sites for in vitro packaging 
in X phage* replication origin (s) and selection markers for 
cloning in E. coll, an autonomously replicating sequence 
30 (ARS) of S. pombe, and one or more different yeast selection 
markers, such as but not limited to puromycin, ura4 r 
hygromycin or zeocin. 

Cosmid vector SuperCosl (Stratagene) was linearized with 
restriction cmdonuclease Bglll. The plasmid, purified by the 
35 geneclean procedure (BiolOl) # was treated with T4 DNA 

polymerase to "fill-in" the DNA termini. The DNA was again 
purified by the geneclean procedure. The DNA was treated 
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with T4 DNA ligase and transformed into E . coli strain DH5a . 
Clones were tested for their ability to be cut with 
restriction enzyme Bgl II. One clone that was resistant to 
Bglll was isolated for further work was called SCos-Bglll. 
5 SCos-Bglll was digested/ linearized with restriction 

endonuclease BamHI. The linearized vector was treated with 
calf-intestine alkaline phosphatase to prevent self-ligation 
and DNA was purified by the geneclean procedure. An 
artificial DNA linker containing two DraX , one Xhol, and one 
10 Bglll restriction endonuclease sites was ligated into the 
BamHI site with T4 DNA ligase. The ligation was transformed 
into E. coli strain DH5a . The resulting plasmid (pSuperCosB) 
no longer contains the BamHI restriction endonuclease 
cleavage site. 

15 pSuperCosB was linearized with restriction endonuclease 

Aatll. The linearized vector was treated with calf-intestine 
alkaline phosphatase to prevent self-ligation, and was 
purified by the geneclean procedure. Concurrently, plasmid 
pDblet was cut with restriction endonuclease Aatll. The 

2 0 digested pDblet plasmid was separated by agarose gel 

electrophoresis. A 1198 bp fragment, containing the yeast 
autonomously replicating sequence (ARS) was cut from the gel 
and purified by the geneclean procedure. The ARS-containing 
DNA fragment was then ligated in the linearized SuperCosl 

25 vector. The ligation was transformed into E. coli strain 
SC110. The resulting clone was called pPCos. 

pPCos was digested with restriction endonucleases Bglll 
and Xhol. pDblet was digested with restriction endonucleases 
BamHI and Xhol. Both digests were separated by agarose gel 

30 electrophoresis. The ura4 gene containing BamHI /Xhol 

fragment and the pPCos vector fragment were cut from the gel 
and purified by the geneclean procedure. The two DNA 
fragments were ligated together using T4 DNA ligase. The 
ligation reaction was then transformed into E. coli strain 
35 DH5a . The resulting plasmid clone was called pPCos+ura. 

The vector pPCos+ura is deposited at the Agricultural 
Research Service Culture Collection (NRRL) , Agricultural 
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Research Service, U.S. Department of Agriculture, 1815 North 
University Street, Peoria, Illinois 61604, U.S.A. on October 
24, 1996, and is given accession number B-2163 7N. 

Alternatively, pPCos can be digested with restriction 
S endonucleases Smal and Belli. This releases a DNA fragment 
containing a truncated neomycin resistance gene. The vector 
is purified from this fragment by agarose gel electrophoresis 
and the geneclean procedure. An artificial DNA linker 
containing Spel, Kpnl, and Ndel restriction endonuclease 
10 sites is ligated into the vector using T4 DNA ligase. The 
ligation is transformed into E. coli strain DH5a. The 
resulting vector is called pPCos-Neo - . 

Moreover, pPCos-Neo' can be linearized with restriction 
endonuclease Ndel . The linearized vector is treated with 
15 calf -intestine alkaline phosphatase to prevent self- ligation 
and DNA was purified by the geneclean procedure. 
Concurrently, plasmid pDblet is digested with restriction 
endonuclease Ndel which releases an approximately 1800bp 
fragment containing the ura4 gene. This fragment is 
20 separated from the rest of the vector by agarose gel 

electrophoresis and purified by the geneclean procedure. The 
ura4 gene fre.gment is ligated into the pPCos-Neo' vector 
backbone using T4 DNA ligase. The ligation is transformed in 
E. coli strain DH5a. The resulting plasmid is pPCosl (see 
25 Figure 14) . 

5.5.8. PLANT EXPRESSION T.TBRARIES 
This section describes procedures that may be generally 
applied to prepare combinatorial gene expression libraries 
30 using plant cells as donor and/or host organisms. 

For preparation of donor DNA from plants, the following 
general procedure is applied: (1) a pretreatment of the plant 
tissue in cold ether to enhance cell disruption; (2) 
mechanical homogenization of the tissue by grinding with 
35 sand, glass beads or- aluminum oxide; (3) filtration through a 
mesh to remove cell debris; and (4) extraction of the DNA by 
the procedures described in 5.1.2. The resulting purified 
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DNA is modified as described in Sections 5.5.3. The CaMV 35S 
or nopaline synthase promoter, and nopaline synthase 
terminator fragments are prepared by PCR as described in 
Section 5.5.3. The promoter and terminator fragments are 
5 attached to the DNA fragments, and ligated to a plant DNA 
vector as described in 5.5.5 and 5.5.6. 

A preferred plant DNA vector is Binl9 or its variants 
which uses T-DNA borders and trans acting functions of the 
vir region of a co-resident Ti plasmid in Agrobacterium 
10 tumefaciens to transfer the donor genetic material into the 
nuclear genome of plant host cells (Bevan 1984, supra). 
Modified Binl9 vectors containing a multiple cloning site, 
such as pBI121 or pBI22l which are commercially available 
(Clontech, Palo Alto) , can be used. Kanamycin resistance 
15 and/or /3-glucuronidase activity are used as markers for 
monitoring transformation, and for pre-screening . 

Plant protoplasts are prepared from leaves of Nicotiana 
tabacuro plants as described in Potrykus et al. 1988 in 
"Methods for Plant Molecular Biology" Weissbach and Weissbach 
20 ed. Academic Press, page 376-378. The expression constructs 
are introduced into protoplast cells by transformation using 
polyethylene glycol as described in Power et al. 1988 in 
"Methods for Plant Molecular Biology" Weissbach and Weissbach 
ed. Academic Press, page 388-391. The transformed 
25 protoplasts are selected by antibiotic resistance, e.g., 
kanamycin, and can be encapsulated for pre-screening as 
described in Section 5.4.10. 

6 EXAMPLE: CONSTRUCTION AND SCREENING OF 

COMBINATORIAL GENE EXPRESSION 

30 LIBRARY 

The following subsections describe the preparation and 
pre-screening of combinatorial gene expression libraries 
using mixtures of terrestrial microorganisms or marine 
microorganisms as donor organisms. The libraries utilize 
35 streptomyces lividans, E. coli and S. porabe as host 

organisms. The results show that some of the library cells 
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display metabolic activity of the donor organisms indicating 
that potentially interesting donor metabolic pathways are 
functional in the host organisms. In addition, it is shown 
that one library clone contains DNA encoding a marine 
S bacterial protein that shares sequence homology to a known 
enzyme in a metabolic pathway. 

6.1. MATERIALS METHODS 
Reagents useful in the present method are generally 
10 commercially available. For example: 

Gene Clean, Genome kit (BiolOl, Vista, CA) ; Restriction 
enzymes, PCR reagents, and buffers (Promega, Madison, HI; New 
England Biolabs; Stratagene, La Jolla, CA) ; TA cloning kit 
(Invitrogen, La Jolla, CA) ; Bacterial media (Difco, Inc.); 
15 Mira Tip (Hawaiian Marine Imports, Inc.); pBSK plasmid, XL1- 
MR cells, supercos 1 cosmid, Gigapack packaging extracts 
(Stratagene, La Jolla, CA) ; Qiagen QIAprep plasmid 
purification kit (Qiagen, Inc., chatworth, CA) ; avidin- 
conjugated magnetic porous glass (MPG) beads (CPG, Inc., New 
20 Jersey); petri dishes, 96- and 384-well plates, Omni-Trays 
(Nunc) , 96- end 384-pin replicator and forms (V & P 
Scientific, San Diego, CA) ; ampicillin (IBI, Inc., CA) ; green 
fluorescent protein and GFP cDNA (Clontech, Inc.); 
oligonucleotides (Genset, La Jolla, CA) ; bacterial species 
25 and DNA sequences not elsewhere designated (American Type 
culture collection, Rockville, MD) ; 7-ethoxy-heptadecyl- 
coumarin, BCi:CF-AM (Molecular Probes, Oregon); 3-methyl 
benzoate, 3-chlorotoluene, m-toluate, tetracycline, 
chloramphenicol, acetaminophen, arsenic, antimony, cis-cis- 
30 muconate, and other chemicals unless noted (Sigma) ; and 
Dynabeads, MPC-M (Dynal, Inc., Lake Success, NY). 

6.1.1. MF.DTA PRE PARATION 
Purified water (ddHjO) for general use in media and 
35 solutions is purified by softening, reverse osmosis, and 
deionization. Pacific seawater (sea H z O) is obtained from 
Scripps Institute of Oceanography (La Jolla, CA) and filtered 
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before use. Synthetic seawater (SSW) is prepared from ddH 2 o 
by the addition of salts (45.2mm NaF, 48.8mm SrCl 2 , 0.324mM 
H 3 B0 3 . 0.563mM KBr , 6.25mM KC1, 4.99mM CaCl 2 , . 7mM Na 2 S0 4 , 
16.4mM MgCl 2 , 268mM NaCl, 45.8mM Na 2 Si0 3 , l.lOmM EDTA, 1.58mM 
5 NaHC0 3 ) and marine trace elements (0.01 % Mira Tip). 

LB medium is prepared from ddH 2 0 with 1% tryptone, 0.5% 
yeast extract, 1% NaCl. W2-B1 is prepared from 75% sea H 2 0 or 
SSW with 0.25% peptone, 0.15% yeast extract, 0.6% (vol/vol) 
glycerol. 

10 F10A is prepared from ddH 2 0 containing 2.5% soluble 

potato starch, 0.2% glucose, 0.5% yeast extract, 0.5% 
peptone, 0.5% Distiller's solubles (Nutrition Products Co., 
Louisville, KY) , 0.3% calcium carbonate with pH adjusted 
to 7. 

15 

6.2. PRE- SCREENING OF ACTINOMYCETES / STREPTOMYCES 
LIVIDANS COMBINATORIAL NATURAL PATHWAY 
EXPRESSION LIBRARY BY PLATE REPLICATION AND 
MACRODROPLET ENCAPSULATI ON 

Thirty four actinomycetes species, identified as species 

20 # 501-534 were used as donor organisms. The organisms were 

cultured in F10A medium separately, and genomic DNA was 

extracted and purified as described in Section 5.3.1. 

Approximately 100 jig genomic DNA per species was 

obtained and mixed together for partial restriction digestion 

25 by Sau3A as described in Section 5.4.2. Fragments of genomic 

DNA were subjected to size fractionation by sucrose gradient 

centrifugation, and fractions containing 20-40 kb fragments 

were pooled and partially filled-in with the Klenow fragment 

so as to be compatible with similarly-prepared vectors below 

30 (Korch 1987, Nuc Acids Res 15:3199-3220; Loftus et al. 1992 

Biotechniques 12:172-175). 0.5-3.0 of the pooled 

fragments were ligated in multiple batches to 0.5-3.0 /xg of 

pIJ922 and pIJ903 (Hopwood 1985 , supra) vector prepared with 

BaroHI or Xhol. The ligated expression constructs were 

35 transformed into the host organism, streptomyces lividans, 

strain TK64 which had been made competent by removal of cell 
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walls with lysozyme (Hopwood 1985 , supra). Approximately 
11,000 unique clones were generated, amplified and stored as 
mycelia in 20% glycerol and as spore suspensions in 50% 
glycerol at -70 °C. 
5 To prepare the libraries for screening individual 

clones, the transformed TK64 host cells were spread on 150mm 
Petri dishes filled with F10A agar. After spreading, the 
plates were allowed to incubate for 21 hours at 3 0°C. A 
selection was performed by overlaying plates with 
10 thiostrepton at 5 tig /ml, l ml/plate. After 48-72 hours, 

colonies were picked with sterile toothpicks and transferred 
one per well to 96-well plates. Each well contained F10A 
media. These inoculated master plates were placed at 3 0 °C 
for 1-4 days. The overnight master 96-well plates were used 
15 as source plates to replicate into one or more working 96- 
well plates or Omni-Trays. The master 96-well plates were 
then sealed individually and frozen at -80 °C. Replication 
was done with a 96-pin replicator which was sterilized by 
flaming before each use. 
20 Working 96-well plates were used as source plates to 

replicate the library onto a series of differential and/ or 
selective media and indicator plates. Selective antibiotics 
included erythromycin, novobiocin and neomycin. Differential 
media included F10A and R5 medium containing substrates X- 
25 glucopyranosi.de and X-gluconic acid. Indicator plates 

included library clones grown on F10A then overlaid with a 
indicator lawn of Enterococus faecalls (E. faecalis) , 
Bacillus subtills (B. subtilis) or SOS Chromotest (with X- 
gal) . The results are compiled and compared to the profiles 
30 of Streptomyces host TK64. 

The clones of the library are also pre-screened by 
macrodroplet encapsulation. For each pre-screen, 50,000 
amplified clones of the library are encapsulated by the 
method as described in Section 5.4.13. 

35 
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6.3. PRE-SCREENING OF ACTINOMYCETES/E . COLI 

COMBINATORIAL CHIMERIC PATHWAY EXPRESSION 
LIBRARY BY MACRODROPLET EN CAPSULATION 

Genomic DNA obtained from the thirty four actinomycetes 

species (identified as species # 501-534) as described in 

5 Section 6.2, were used in the preparation of a combinatorial 

chimeric pathway gene expression library in a S. lividans 

host. Fractions containing fragments of Sau3A-digested 

genomic DNA of 2-7 kb were pooled. 

Aliquots of the genomic DNA fragments are ligated to the 

10 different promoters separately to form gene cassettes as 

described in Section 5.5.3. The concatemers are formed by 8 
cycles of ligation and deprotection using a different pool of 
gene cassettes for each cycle, such that the resultant 
concatemers each have 8 gene cassettes comprising 8 different 

15 promoters attached to fragments of genomic DNA. 

Ten micrograms of the concatemers were circularized and 
ligated to 0,5 |ig of SuperCos 1 vectors at the BamHl site to 
form expression constructs, which were packaged in vitro for 
infection of the E. coli host cells XL1-MR according to the 

20 manufacturer's directions (Stratagene) . Approximately 

1,000,000 of unique clones are obtained, amplified and pooled 
to form an amplified library. The library was stored at - 
70 °C. Amplified cells are encapsulated as in Section 5.4.10, 
and pre-screened as in 5.4.14. 

25 

6.4. PRE-SCREENING OF FUNGAL / S CHIZOSACCHAROMYC ES 

POMBE COMBINATORIAL CHIMERIC PATHWAY 
EXPRESSION LIBRARIES BY MACRODROPLET 

encapsulation 

Two combinatorial chimeric pathway expression libraries 
were prepared using the following fungal donor organisms 
obtained from ATCC: Trichoderma reesei, Fusarium oxysporum, 
Penicillium roquefortli, Rhizopus oligosporus, Neurospora 
crassa, Phycomyces blankesleeanus, Aspergillus fumigatus, 
Aspergillus i'lavus, Emericella heterothallica, Chaetomium 
35 gracile, Penicillium notatum, Penicillium chrysogenum. 



30 
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Each species was cultured separately in 500 ml potato 
dextrose agar (PDA; Difco) or malt extract agar (MEA; Difco) 
at medium rpm for 48-72 hours. Spore inoculations of lxlO 4 - 
lxio 6 spores per ml were placed into 500 ml of potato extract 
5 or malt extract broths in 1 liter culture flasks and grown at 
22 C, 225 rpm, 48-72 hours. 

Cultures were harvested by filtration through Miracloth 
(Calbiochem) under vacuum. The collected mycelial masses were 
washed with 2 litres of ddH 2 o, and air-dried for 10 minutes 
10 before freeze drying. Fungal genomic DNA and mRNA were 
extracted and purified from the mycelia as described in 
Sections 5.3.1 and 5.3.2. A portion of the harvested mycelia 
were freeze-dried and stored at -70 °C. 

Fungal genomic DNA fragments were prepared as described 
15 in Sections 5.4.2. Fungal mRNA was converted into cDNA 

according to standard methods. (Sambrook et al. 1989, Watson 
CJ & Jackson JF (1985) DNA cloning: A practical approach 79- 
88 , IRL Press) . Weight equivalents of DNA fragments from 
each species were pooled to yield a genomic DNA pool and a 
20 cDNA pool. 

Each of these pools containing approximately 5-10 ng of 
DNA is used independently to assemble a combinatorial 
chimeric pathway expression library. The following S. 
pombe-compatible promoters and terminators were generated as 
25 described in Section 5.5.2: CMV immediate/early , SV40 

early, RSV, HSV thymidine kinase, CaMV, nmtl, adhl and uva4 
promoters. The promoter and terminator fragments are 
combined with, the cDNA and genomic DNA pools as described in 
Sections 5.5.4. Each gene cassette averaging 5 kb in length 
30 is concatenated as described in Section 5.5.5. The final 

concatemers containing 8 gene cassettes each are circularized 
and inserted into the vector modified pDblet (Brun et al. 
1995, Gene, Vol. 164 pp. 173-177) as described in Section 
5.5.7. The expression constructs were transformed into S. 
35 pombe cells via lithium acetate method of Gietz and Woody (FD 
Gietz & RA Woody, Molecular genetics of yeast: A practical 
approach, chapter 8, pp 121-134). Upon selection for 
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presence of the ura4 marker, 110 f 000 S. pombe clones are 
obtained and amplified. The clones are pooled to form an 
amplified library ready for pre-screening • The following 
pre-screens are performed: enzyme substrate test, anti- 
5 microbial activity, antibiotic resistance. 

6.5. ^RE-SCREENING OF MARINE GRAM (-)/£. COLI 
LIBRARY BY PLATE flKPT.T CATION 

Marine bacteria obtained from seawater collected near 
the Bahamas Islands were provided by the Harbor Branch 
Oceanographic Institute. Each of the wild-type gram-negative 
pigmented marine bacterial species was tested prior to 
preparation of the DNA libraries to determine redundancy, and 
to help determine the array of pre-screens to be done on the 
completed libraries. 

The following assays were performed on the parental 
species of marine gram-negative/*, coll library, with the 
indicated results: 



10 



15 



20 



25 



Table V: 

Assav Positive species rmt of 37 species 

Chromazurol S (CAS) 27 

Streptococcus pyogenes 0 

Enterococcus faecal is 3 

Proteus mirabils * 

Sarclna aurantiaca ±° 

Staphylococcus aureus 6 

Starch digestion 17 



Of these assays, the following were selected to be 
performed on the cells of the combinatorial gene expression 
library in E. coll : CAS; S. aureus/ S. aurantiaca; starch 
digestion. 

30 Briefly, each of the 40 parental species was inoculated 

into 5 ml of B3 medium and cultured overnight at 30°C, 300 
rpm in Falcon 2059 tubes. the overnight cultures were 
pelleted and the total genomic DNA extracted by standard 
procedures. Genomic DNA was quantified by visualization on 

35 an agarose gel and 5 ng DNA from each of the 40 species was 
contributed to a pool totaling 200 M9- The combinatorial 
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natural pathway expression libraries were assembled in E. 
coll as described in Section 5.1.4. This DNA was partially 
digested, ligated to SuperCosl and packaged in X phage for 
introduction into E. coll according to the SuperCosl 
5 manufacturer's directions (Stratagene) . This resulted in 5 x 
10 6 unique clones, which was amplified to 7 x 10 8 /ml cfu by 
standard protocols. The amplified stock was stored in 15% 
glycerol at -70°C for subsequent use. 

To prepare the libraries for screening individual 
10 clones, the amplified library cells were spread on 150mm 
Petri dishes with 50ml LB, lOOmg/ml ampicillin and 50mg/ml 
kanamycin. The plates were previously dried for 24 hours at 
ambient temperature in the dark. The 7 X loV^l cfu stock was 
diluted in LB to 500 cfu/ml. One ml was spread on each 150mm 
IS plate. After spreading, the plates were allowed to incubate 
overnight at 37 °C. Resulting colonies were picked with 
sterile toothpicks and transferred one per well to 384-well 
plates. 640C colonies were picked and archived. Each well 
contained 7 5 Hi LB, 50 /ig/ml ampicillin, 7% glycerol. The 
20 outer rows (6 0 wells total) were not inoculated but were 
similarly filled with medium to provide an evaporation 
barrier during subsequent incubation and freezing. These 
inoculated master plates were placed at 37 °C for 16 hours 
without shaking. The overnight master 384-well plates are 
25 used as a source plate to replicate into one or more working 
multi-well plates or omni-Trays. The master 384-well plates 
were then sealed individually and frozen at -80«>C. 
Replication was done with a multi-pin replicator. Before and 
after each use, the 384-pin replicator was dipped 
30 sequentially into bleach for 20 seconds, water for 30 
seconds, then ethanol for 5 seconds before flaming. 

Working multi-well plates or Omni-Trays were used as 
source plates; to replicate the DNA libraries onto a series of 
differential and/or selective media (e.g. siderophore 
35 detection media (CAS) or antimicrobial lawns) . The results 
were compiled and compared to the profiles of the wild-type 
marine bacteria used to construct the DNA library. 
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Six clones were isolated that were positive for starch 
digestion ability. These clones were tested for the ability 
to inhibit growth of S. aureus or S. aurantiaca, and one 
clone was found to inhibit the growth of S. aurantiaca. This 
5 clone was subjected to further analysis, including DNA 
sequence analysis, and was found to contain DNA sequences 
encoding proteins homologous to those in a polyketide 
synthesis pathway. Figure 10 shows the alignment of the 
predicted amino acid sequence of a DNA sequence derived from 
10 clone CXC-AMN20 with the actinorhodin dehydrase gene of 
Streptomyces coeli col or . 

The active component from this clone is further analyzed 
by extraction with organic solvents and purification guided 
by anti-microbial assays. 
15 The DNA sequence contained in this clone was further 

examined by multiplex PCR to determine the cognate parental 
species. PCR primers were selected and synthesized based on 
sequence of the clone. Highly conserved ribosomal RNA primer 
sequences were used in the PCR as positive control. The 
20 positive control generates a fragment of approximately 2 kb. 
The amplicon generated from the clone or its cognate parental 
species was less than 600 bp. Initially, mulitplex PCR 
reactions were performed by standard method using a set of 
four pools of genomic DNA of the parental species. Genomic 
25 DNA from Pool 1-3 produced the amplicon upon amplification. 
See Figure 11. The multiplex PCR reactions were repeated 
with genomic DNA of individual parental species. Figure 12 
shows that genomic DNA derived from species #6 from Pool 1, 
species #18 from Pool 2 and species #31 from Pool 3 were 
30 positive in the PCR reaction. This suggested that the 

identified DNA sequence was likely derived from any of these 
3 species of marine bacteria. 

Thus, the results show that the combinatorial gene 
expression library contains clones carrying genetic material 
35 derived from marine bacteria that encodes metabolic pathway 
of interest. Furthermore, it is shown that such clones in 
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10 



15 



the library can be identified, and isolated by pre-screening. 

6.6. PRE-SCREENING OF MARINE GRAM ( - ) /B . COLI 

LIBRARY BY MACRODROPLET ENCAPSULATION 

30,000 clones were encapsulated by taking sodium 

alginate (Prctanol LF 20/60, Pronova Biopolymer, Drammer, 

Norway) and dissolving it in 100 mL of sterile water at a 

concentration of 1% using an overhead mixer at 2000 rpm. 

One ml of library suspension containing 30,000 cells was 

added so as to embed 1-5 clones per droplet. The mixture was 

allowed to sit for 30 minutes to degas. The mixture was then 

extruded through a 25 gauge needle. These fluids were 

dropped into an 0.5L gently stirring beaker of 13 5mM calcium 

chloride. Droplets were allowed to harden for 10 minutes and 

then were transferred to a sterile flask and the calcium 

chloride removed and replaced with LB/Amp media and a 

substrate, X-glucosaminide, at 80 iiq/ml. Other substrates 

were X-acetat.e, X-glucopyranoside, X-gal and specific custom 

substrates relevant to polyketide pathways. Flasks 

containing the droplets were then shaken at 30 °C overnight 

and examined the following morning for positive clones 

indicated by the presence of blue colonies. Clones are also 

co-encapsulated with indicator cells as described in 5.4.14. 

Indicator cells include S. aureus, S. aurantiaca. 

Droplets were placed in a single layer in a large clear 

tray and scanned by eye. One X-glucosaminide positive was 

recovered, resuspended in 15% glycerol and stored at -70°C. 

Other positive colonies are removed and placed in 96-well 

master plates containing LB/Amp and 50 mM sodium citrate pH 

7.4 to dissolve the matrix, and allowed to grow at 37 °C 

overnight. These overnight master 96-well plates are used as 

a source plate to replicate into one or more working 

multi-well plates or Omni-Trays. The master 9 6-well plates 

are then sealed individually and frozen at -80 °C. Positive 

clones are either sent for specific testing of the products 

or sent through another round of pre-screening or screening. 
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30 
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Further screening is performed by replication which is done 
with a multi-pin replicator. 

7 EXAMPLE: CONSTRUCTION AND SCREENING °* 

7. tAAm-i. ACTINOMYCETES / STREPTOMYCES LIVIDANS 

5 COMBINATORIAL GENE EXPRESSION 

LIBRARY 

The following subsections describe the preparation of a 
combinatorial natural pathway expression library and a biased 
combinatorial gene expression library using a mixture of 
10 terrestrial microorganisms as donor organisms. Both 
libraries utilized B. coli as an archival host, and 
Streptomyces lividans as the expression host. 

Briefly, to make the combinatorial natural pathway 
expression library, an archival cosmid library was made with 
15 the DNA of donor organisms. The inserts of the library was 
then isolated and recloned into streptomyces vectors for 
introduction into the host, Streptomyces lividans. To make 
the biased combinatorial gene expression library, the 
archival library was pre-selected by hybridization with genes 
20 encoding type II polyketide biosynthetic pathways. Genetic 
material potentially encoding type II polyketide pathways 
from the actinomycete donor organisms were isolated, randomly 
mixed, and re-cloned into S. lividans to form the 
combinatorial expression library. 
25 The clones from both libraries were analysed for 

antibiotic activity that are potentially the products of 
novel or hybrid metabolic pathway that are functional in the 
host organisms. 

30 7.1. m&TF.RTATjS METHODS 

According to the invention, an archival library was 
constructed in the cosmid P WE15 and E. coli using the 
chromosomal DNA isolated from thirty-four donor actinomycete 
strains . 

35 To make the combinatorial natural pathway gene 

expression library, cosmid DNA was prepared from clones of 
the archival cosmid library, and were pooled. Due to the 
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fact that actinomycete donor DNA has a high GC content, an 
enzyme Dral (that does not have G or C in its recognition 
site) was used to isolate the actinomycete donor DNA from the 
pooled cosmid DNA. DNA fragemnts greater than 25 kb in size 
S were enriched by sucrose gradient centrifugation, and ligated 
to linkers compatible to the cloning sites of the expression 
vectors. The following Streptomycete expression vectors were 
used to form the library: pIJ941, pIJ702, pIJ699, pIJ922 and 
pIJ903 (Hopwood et al. , 1985 Genetic Manipulation of 
10 streptomyces, A Laboratory Manual, The John Innes 

Foundation) . The combinatorial natural pathway expression 
library was then introduced into the expression host S. 
lividans TK64. Eight thousand and two hundred (8,200) clones 
were picked, cultured separately, and analysed for the 
15 ability to inhibit growth of Micrococcus lutens, 

Staphylococcus aureus, Bacillus subtilis, Escherichia coli, 
Saccharomyces cerevisia&, Candida albicans, and Penicilliuro 
chrysogenum. The clones were also tested for DNA reactivity 
by the SOS test. 
20 To make the biased combinatorial gene expression 

library, the archival cosmid library was plated out on 150 mm 
petri dishes to a density of approximately 2,000 colonies per 
plate. A total of 60,000 colonies were screened. The plates 
were incubated for 18 hours at 30 °C, in order to produce 
25 colonies that, were approximately 0.5 mm in diameter. These 
colonies were then replicated to 137 mm, 0.45 (im Nytran 
discs. The discs were then placed on sheets of 3 MM paper 
pre-soaked in 10 % SDS with the colony side up, and left to 
soak for 3 minutes. These discs were transferred to sheets 
30 of 3 MM paper, presoaked in 0.5 M NaOH, 1 M NaCl for 5 

minutes; then to 3 MM paper presoaked in 0.5 M Tris-HCl, 1.5 M 
NaCl for another 5 minutes; and finally to 3 MM paper 
presoaked in 2xSSC for 5 minutes. The discs were then left 
to dry in air for at least 6 hours. The DNA was then fixed 
35 to the filter by UV crosslinking. DNA probes specific to 
actl (Malpartida et al., 1984, Nature 309:462-464) and whiE 
(Davis et al. , 1989, Molecular Microbiology 4:1679-1691) 
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polyketide pathway were labelled using the non-radioactive 
DIG labelling kit and hybridized to the filter discs at 60°C 
in a solution of 5xSSC/0.1 % SDS overnight. After the 
incubation, the discs were washed at 60°C in lxSSC/0.1% SDS 
5 for one to four times, and the colonies which had hybridized 
with the probes were detected. 

About 60-70 positive clones were isolated from the 
original plates, and pooled. Cosmid DNA was isolated from 
the pooled clones, and digested with Dral to separate the 
10 cosmid vector DNA from the actinomycete DNA. In order to 
randomly mix the genes in the metabolic pathways, the 
actinomycete DNA was partially digested with Sau3AI to 
generate fragemnts in the range of approximately 4-10 kb, anc 
the fragments were ligated to form concatamers of greater 
15 than 50 kb. The ligated DNAs were redigested partially with 
Sau3AI to generate fragments with an approximate size range 
of 15-3 0 kb, which were ligated into the Bglll site of the 
vector pIJ702 (Hopwood et al., 1985 Genetic Manipulation of 
Streptomyces , A Laboratory Manual, The John Innes 
2 0 Foundation) . The biased combinatorial expression library wai 
then intorduced into the expression host S. lividans TK64. 
Two thousand and two hundred (2,200) clones were picked, 
cultured separately, and analysed for the ability to inhibit 
growth of Micrococcus lutens (MLUT) , Staphylococcus aureus 
25 (SA1) , Bacillus subtilis (BS8) , Escherichia coll (E. coli) , 
Saccharomyces cerevisiae (SC7) , Candida albicans (CA917), an 
Penicillium chrysogenum (PC) . The clones were also tested 
for DNA reactivity by the SOS test (SOS). 

30 7.2. RESULTS 

For the combinatorial natural pathway expression 
library, 8,200 clones were screened for antimicrobial 
activities, and 205 clones (2.5%) of interests were 
identified. Table VI shows the antimicrobial assay results 

35 of 20 clones from the combinatorial natural pathway 
expression library. 
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Further screening is performed by replication which is done 
with a multi-pin replicator, 

7, EXAMPLE: CONSTRUCTION AND SCREENING OF 

5 ACTINOMYCETES/ STREPTOMYCES LIVIDANS 

COMBINATORIAL GENE EXPRESSION 
LIBRARY . 

The following subsections describe the preparation of a 

combinatorial natural pathway expression library and a biased 

combinatorial gene expression library using a mixture of 

10 terrestrial microorganisms as donor organisms. Both 
libraries utilized E. coli as an archival host, and 
Streptomyces lividans as the expression host. 

Briefly , to make the combinatorial natural pathway 
expression library, an archival cosmid library was made with 

15 the DNA of donor organisms. The inserts of the library was 
then isolated and recloned into Streptomyces vectors for 
introduction into the host, Streptomyces lividans. To make 
the biased combinatorial gene expression library, the 
archival library was pre-selected by hybridization with genes 

20 encoding type II polyketide biosynthetic pathways. Genetic 
material potentially encoding type II polyketide pathways 
from the actinomycete donor organisms were isolated, randomly 
mixed, and re-cloned into S. lividans to form the 
combinatorial expression library. 

25 The clones from both libraries were analysed for 

antibiotic activity that are potentially the products of 
novel or hybrid metabolic pathway that are functional in the 
host organisms. 

30 7.1. MATERIALS AND METHODS 

According to the invention, an archival library was 
constructed in the cosmid pWE15 and E. coli using the 
chromosomal DNA isolated from thirty-four donor actinomycete 
strains. 

35 To make the combinatorial natural pathway gene 

expression library, cosmid DNA was prepared from clones of 
the archival cosmid library, and were pooled. Due to the 
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fact that actinomycete donor DNA has a high GC content, an 
enzyme Dral (that does not have G or C in its recognition 
site) was used to isolate the actinomycete donor DNA from the 
pooled cosmid DNA. DNA fragemnts greater than 25 kb in size 
5 were enriched by sucrose gradient centrifugation, and ligated 
to linkers compatible to the cloning sites of the expression 
vectors. The following Streptomycete expression vectors were 
used to form the library: pIJ941, plJ702, pIJ699, pIJ922 and 
PIJ903 (Hopwood et al., 1985 Genetic Manipulation of 

10 Streptomyces, A Laboratory Manual, The John Innes 

Foundation) . The combinatorial natural pathway expression 
library was then introduced into the expression host S. 
lividans TK64 . Eight thousand and two hundred (8,200) clones 
were picked, cultured separately, and analysed for the 

15 ability to inhibit growth of Micrococcus lutens. 

Staphylococcus aureus r Bacillus subtilis, Escherichia coli f 
SaccharomycBs cerevisiao, Candida albicans, and Penicillium 
chrysogenum. The clones were also tested for DNA reactivity 
by the SOS test. 

20 To make the biased combinatorial gene expression 

library, the archival cosmid library was plated out on 150 mm 
petri dishes to a density of approximately 2,000 colonies per 
plate. A total of 60 , 000 colonies were screened. The plates 
were incubat€td for 18 hours at 30°C, in order to produce 

25 colonies that, were approximately 0.5 mm in diameter. These 
colonies were then replicated to 137 mm, 0.45 Nytran 
discs. The discs were then placed on sheets of 3 MM paper 
pre-soaked in 10 % SDS with the colony side up, and left to 
soak for 3 minutes. These discs were transferred to sheets 

30 of 3 MM paper, presoaked in 0.5 M NaOH, 1 M NaCl for 5 

minutes; then to 3 MM paper presoaked in 0.5 M Tris-HCl, 1.5 M 
NaCl for another 5 minutes; and finally to 3 MM paper 
presoaked in 2xSSC for 5 minutes. The discs were then left 
to dry in air for at least 6 hours. The DNA was then fixed 

35 to the filter by UV crosslinking. DNA probes specific to 
actl (Malpartida et al., 1984, Nature 309:462-464) and whiE 
(Davis et al. , 1989, Molecular Microbiology 4:1679-1691) 
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polyketide pathway were labelled using the non-radioactive 
DIG labelling kit and hybridized to the filter discs at 60 °C 
in a solution of 5xSSC/0.1 % SDS overnight. After the 
incubation, the discs were washed at 60°C in lxSSC/0.1% SDS 
5 for one to four times, and the colonies which had hybridized 
with the probes were detected. 

About 60-70 positive clones were isolated from the 
original plates, and pooled. Cosmid DNA was isolated from 
the pooled clones, and digested with Dral to separate the 

10 cosmid vector DNA from the actinomycete DNA. In order to 
randomly mix the genes in the metabolic pathways, the 
actinomycete DNA was partially digested with Sau3AI to 
generate fragemnts in the range of approximately 4-10 kb, and 
the fragments were ligated to form concatamers of greater 

15 than 50 kb. The ligated DNAs were redigested partially with 
Sau3AI to generate fragments with an approximate size range 
of 15-30 kb, which were ligated into the Bglll site of the 
vector pIJ702 (Hopwood et al., 1985 Genetic Manipulation of 
Streptomyces , A Laboratory Manual, The John Innes 

20 Foundation) . The biased combinatorial expression library was 
then intorduced into the expression host S. llvldans TK64. 
Two thousand and two hundred (2,200) clones were picked, 
cultured separately, and analysed for the ability to inhibit 
growth of Micrococcus lut&ns (MLUT) , Staphylococcus aureus 

25 (SA1) , Bacillus subtilis (BS8) , Escherichia coli (E. coli) , 
Saccharomyces cerevisiae (SC7) , Candida albicans (CA917) , and 
Penicillium chrysogenum (PC) . The clones were also tested 
for DNA reactivity by the SOS test (SOS) . 

30 7.2. RESULTS 

For the combinatorial natural pathway expression 
library, 8,200 clones were screened for antimicrobial 
activities, and 205 clones (2.5%) of interests were 
identified. Table VI shows the antimicrobial assay results 

35 of 20 clones from the combinatorial natural pathway 
expression library . 
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For the biased combinatorial gene expression library, 
2,200 clones were screened for antimicrobial activities, and 
71 clones (3.2%) of interests were identified. Table VII 
shows the antimicrobial assay results of nine clones from the 
5 combinatorial natural pathway expression library, which will 
be subjected to chemical structural analysis. 
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The data show that both the combinatorial natural 
pathway expression library and the biased combinatorial gene 
expression library contain clones carrying genetic material 
that encodes metabolic pathways, possibly novel or hybrid 
5 polyketide pathways, that results in the production of 
compounds with antimicrobial activities. 

Having thus disclosed exemplary embodiments of the 
present invention, it should be noted by those skilled in the 

10 art that the disclosures are exemplary only and that various 
other alternatives, adaptations, and modifications may be 
made within the scope of the present invention. Accordingly, 
the present invention is not limited to the specific 
embodiments as illustrated herein, but is only limited by the 

15 following claims. 



20 



25 



30 



35 



- 124 



WO 98/17811 



PCT/US97/19958 



MICROORGANISMS 




Optional Sheet in connection with the microoraenism referred to on osae 36 end 107 . 


lines 35 end 3 of the 


description ' 




A. IDENTIFICATION OF DEPOSIT 1 




Further deposits ere identified on en additional sheet ' 




Name of depositary institution * 




Agricultural Research Culture Collection (NRRL) 
International Depositary Authority 




Address of depositary institution (including postal code and country) 4 




1816 N. University Street 

Peoria, IL 61604 

US 




Date of deposit ' October 24. 1 996 Accession Number • B-21637N 


B. ADDITIONAL INDICATIONS • (leave blank If not applicable). This information b continued on a tcpsraie attached sheet 




C. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ' or a.**—-. 


MatKMpattJSW*) 




D. SEPARATE FURNISHING OF INDICATIONS • (lave blank tf not applicable) 


The Indications listed below will be submitted to the International Bureau later ' (Specify the oeneral nati 
"Accession Number of Deposit") 


ire of the indications e.g., 


E. □ This sheet was received with the International application when filed (to be che 


eked by the receiving Office) 


(Authorized Officer) 




□ The date of receipt (from the applicant) by the International Bureau " 




was 

(Authorized Officer) 




Form PCT/RO/ 134 (January \ SB \) 




-125- 





WO 98/17811 PCT/US97/19958 
WHAT IS CLATMEH TR» 

Xm A roobilizable combinatorial gene expression 

5 library, comprising a pool of expression constructs, each 
expression construct comprising a shuttle vector capable of 
replicating in different species or strains of host cell, 
said shuttle vector containing a cDNA or genomic DNA fragment 

10 derived from a plurality of species of donor organisms, 
wherein the cDNA or genomic DNA fragment is operably- 
associated with one or more regulatory regions that drives 
expression of genes encoded by the cDNA or genomic DNA 

15 fragment in an appropriate host organism, 

2. The gene expression library of claim 1 wherein 

^ the cDNA or genomic DNA fragments contained in the expression 
constructs axe randomly concatenated, and are derived from 
one or more species of donor organisms. 

25 3# The 9©ne expression library of claim 1 wherein 

some of the cDNA or genomic DNA fragments contained in the 
expression constructs are preselected for a specific 
property . 

30 

4. The gene expression library of Claim 1, 2 or 3 

in which the expression construct comprises a plasmid vector, 
a phage, a viral vector, a cosmid vector, or an artificial 

35 

chromosome . 
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5. The gene expression library of Claim 1, 2 or 3 
in which the shuttle vector further comprises an origin of 
transfer . 

5 

6. The gene expression library of Claim 1, 2 or 3 
in which the donor organisms comprise a mixture of 
microorganisms . 

10 

7. The gene expression library of Claim 1, 2 or 3 
in which each expression construct is contained in a host 
cell. 

15 

8. The gene expression library of Claim 7 in which 
the host cells have been modified by the introduction, 

20 induction or overproduction of a known metabolic pathway of 
interest or portion thereof. 



9. The gene expression library of Claim 7 in which 
25 the host cell is Escherichia coli, Bacillus subtilis, 

Streptomyces lividans, Streptomyces coelicolor, Pseudomonas 
aeruginosa, Myxococcus xanthus, Saccharomyces cerevisiae, 
Schizosaccharomyces pombe, Spodoptera frugiperda, Aspergillus 
30 nidulans, Arabidopsis thaliana f Nicotiana tabacum, COS cells, 
293 cells, VKRO cells, NIH/3T3 cells, or CHO cells. 

10. The gene expression library of Claim 7 in which 

35 

the host cells further contain a reporter regimen tailored to 
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identify clones in the library that are expressing desirable 
metabolic pathways or compounds. 



11. The gene expression library of Claim 7 in which 

the reporter regimen comprises DNA encoding a reporter gene 
operably-asscciated with a regulatory region that is 
inducible or modulated by the desirable metabolic pathways or 
compounds expressed by the host cell. 



12. The gene expression library of Claim 7 in which 

^ the host cells are in a matrix containing a reporter regimen 
tailored to identify clones in the library that are 
expressing desirable metabolic pathways or compounds. 

20 13 ' A met **od for making a mobilizable combinatorial 

gene expression library, comprising ligating a shuttle 
vector, capable of replicating in different species or 
strains of host cell, to cDNA or genomic DNA fragments to 

25 form expression constructs, wherein said cDNA or genomic DNA 
fragments are obtained from a plurality of species of donor 
organisms, and wherein the genes contained in the cDNA or 
genomic DNA fragments are operably-associated with their 

30 native or exogenous regulatory regions which drive expression 
of the genes in an appropriate host cell. 

35 14 ■ The method of claim 13 wherein the cDNA or 

genomic DNA fragments contained in the expression constructs 
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are randomly concatenated, and are derived from one or more 
species of donor organisms. 



5 



15. 



The method of claim 13 wherein some of the cDNA 



or genomic DNA fragments contained in the expression 



constructs are preselected for a specific property. 



10 



16. 



♦The method of Claim 13, 14 or 15 in which the 



DNA vector is a plasmid vector, a phage, a viral vector, a 
cosmid vector, or an artificial chromosome. 

15 

17. The method of Claim 13, 14 or 15 in which the 

shuttle vector further comprises an origin of transfer. 



expression library comprising transferring a pool of 
expression constructs in a species of host organism to 
another species or strain of host organism, said expression 

25 construct comprising a shuttle vector capable of replicating 
in different species or strains of host cell, said shuttle 
vector comprising cDNA or genomic DNA fragments obtained from 
a plurality of species of donor organisms, wherein the genes 

30 contained in the cDNA or genomic DNA fragments are oper ably- 
associated with their native or exogenous regulatory regions 
which drive expression of the genes in an appropriate host 
cell. 



20 



18. 



A method for making a combinatorial gene 



35 
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19. The method of claim 18 wherein the pool of 

expression constructs is transferred by conjugation. 



20. The method of claim 18 wherein the pool of 

expression constructs is transferred by isolating the 
expression constructs from a first species of host organism, 
and introducing the expression constructs into a second 
species or strain of host organism. 



21. The method of claim 20 wherein the expression 

^ constructs are introduced into a second species or strain of 
organisms by 'transformation, transf ection, infection or 
electroporati on . 

20 22 * A COSIn i d vector comprising an autonomously 

replicating sequence of Schizosaccharomyces pombe. 

23. The cosmid vector of claim 22, which is 

25 pPCos+ura. 



24. The cosmid vector of claim 22, which is pPCosl. 

25. The gene expression library of claim 3 wherein 
the cDNA or genomic DNA fragments are preselected for 
homology to nucleic acid sequences encoding proteins in a 
metabolic pathway. 
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26. The method of claim 15 wherein the cDNA or 

genomic DNA fragments are preselected for homology to nucleic 
acid sequences encoding proteins in a metabolic pathway. 

5 
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